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Resolution-Oriented  Fault  Interrelationships  in 
Combinational  Logic  Networks 

VINOD  K.  AGARWAL  and  GERALD  M.  MASSON 

Abstract — This  correspondence  considers  fault  resolution  as 
a  process  of  applying  a  sequence  of  input  vectors,  called  tests,  to  a 
combinational  logic  network  in  order  to  resolve  an  existing  fault 
situation  from  within  a  given  master  set  of  faults.  A  functional 
approach  based  upon  an  extension  of  the  well-known  Boolean 
difference  concept  to  fault  dependent  situations  is  described.  The 
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Fig.  2.  Comparison  of  execution  times  between  the  focus  number  system  and  an  integer  system. 
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Fig.  3.  Comparison  of  accuracy  among  the  focus,  fixed-point,  and  floating-point 
number  systems. 


raising  a  number  to  a  noninteger  power  is  performed  essentially 
like  a  single  multiplication  in  the  prior  art,  and  the  wide  range 
of  possible  states  can  represent  very  large  or  very  small  answers. 
The  difficult  but  useful  square-root  function  is  extracted  in  focus 
simply  by  shifting  right  1  bit,  then  adding  0  1000.000. 

v.  Comparison  of  Focus  to  the  Prior  Art 

.  Fig.  2  is  a  qualitative  comparison  of  execution  times  between 
the  focus-number  system  and  an  integer  system  requiring  soft¬ 
ware  multiplication.  Exact  execution  times  are  a  function  of  many 
things  (for  example,  inclusion  or  exclusion  of  optional  steps  and 
facileness  of  the  machine  language);  however,  overall  it  is  ex¬ 
pected  that  the  focus-number-system  algorithms  will  perform 
a  mixture  of  calculations  about  five  times  faster  in  a  microcom¬ 
puter  than  a  prior-art  fixed-point  (integer)  system  using  software 
multiplication.  This  estimated  number  does  not  include  the  time 
sayings  expected  by  using  single  instead  of  double  precision,  as 
is  allowed  by  focus  in  certain  applications.  Software  floating¬ 
point  times  are  not  shown,  but  this  would  be  slower  than  fixed- 
point  arithmetic. 

Fig.  3  is  a  quantitative  comparison  of  accuracy  among  the 
focus,  fixed-point  (integer),  and  floating-point  number  systems, 
in  which  all  are  normalized  to  8-bit  bipolar  systems.  The  error 
figure  given  is  the  maximum  relative  error  in  the  region  around 
any  given  number  represented.  This  is  half  the  step  size,  and  is 
v  3  times  the  root-mean  square  error  in  that  region.  In  this  figure 
the  fixed-point  system  is  seen  to  be  the  most  accurate  system  in 
a  narrow  range  between  very  large  and  very  small  numbers.  It  is 
thus  useful  in  processing  a  well-behaved  signal  for  which  the 
virtual  ground  concept  of  focus  control  or  the  wide  range  of  the 
focus  system  are  not  required,  and  where  speed  is  not  important. 
Integer  arithmetic  is  also  superior  in  certain  operations  involving 
chain  addition,  such  as  a  program  counter.  The  floating-point 
system  inefficiently  crowds  states  when  the  mantissa  is  large, 
leaving  sparsely  covered  regions  of  high  error  scattered  across  the 
number  line.  Base  ten  was  chosen  for  comparison.  A  larger  base 
would  result  in  wider  oscillations  and  a  smaller  base  would  result 
in  a  higher  average  error  due  to  redundant  states;  for  example, 
base  two  would  have  a  50-percent  state  redundancy  resulting  in 
an  average  near  the  10-percent  error  line.  Floating  point,  although 
very  slow  without  a  costly  arithmetic  unit,  is  attractive  in  large 


computers  of  over  16  bits  for  which  the  lookup  table  of  focus 
becomes  unpractically  long.  If  a  direct  hardware  implementation 
of  the  focus  table  is  found,  floating-point  arithmetic  will  be  se¬ 
riously  challenged  in  all  applications. 

VI.  Conclusion 

It  is  concluded  that,  for  a  fixed  word  length.  Focus  is  superior 
in  accuracy  to  floating-point  arithmetic  in  all  cases  and  to  fixed 
point  (integer)  in  many  cases,  especially  including  control  systems 
for  which  the  sensing  control  signals  can  be  reduced  by  differ¬ 
ential  inputs  or  other  analog  means  into  forms  that  approach  zero 
as  the  system  approaches  the  desired  controlled  state. 
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T(F»)  =  \x\z\xyM)  9  Z\x,au ]  =  1|.  (1) 

In  words,  for  every  X e  T(FU),  Z\X;YU\  is  different  than  the 
output  of  the  faulty  network,  Z[X;aM],  Thus,  when  any  test 
X e  T(Fm)  is  applied  to  a  network,  if  the  Boolean  output  of  the 
network  is  different  than  the  Boolean  value  of  the  output  func¬ 
tion  describing  the  fault-free  network,  we  say  the  network  has 
failed  the  test;  otherwise  we  say  the  network  has  passed  the 
test. 

Clearly,  the  test  set  T(Fu )  is  based  on  comparing  the  operation 
of  a  faulty  network  with  that  of  a  fault-free  network.  However, 
in  order  to  characterize  the  fault  interrelationships  germane  to 
the  fault  resolution  process,  it  will  be  necessary  to  compare  the 
operations  of  the  network  under  two  fault  conditions.  Thus,  as¬ 
sume  that  fault  condition  Fj  is  known  to  exist  in  N,  and  we  are 
to  determine  a  complete  test  set  for  detecting  F /  in  network  N 
with  Fj  present.  We  will  refer  to  such  a  test  set  as  a  fault  de¬ 
pendent  test  set,  and  denote  it  as  T(F/\Fj).  It  should  be  clear 
that 

T(F,\Fj)  =  \X\Z[X;Y,,aj)  9  Z\X-,a,,aj]  =  lj.  (2) 

Note  that  such  a  test  is  based  on  a  fault  dependent  Boolean 
difference  which  only  gives  valid  results  for  the  network  if,  in¬ 
deed,  this  assumed  fault  Fj  exists  in  the  network.  Also  note  that 
in  applying  a  fault  dependent  test,  the  operation  of  fault-free 
network  is  not  of  concern,  and  the  conclusion  as  to  whether  or  not 
the  network  has  failed  or  passed  the  test  must  take  this  into  ac¬ 
count.  Our  convention  for  this  will  be  the  following.  For  a  test  Xt 
6  T(F,\Fj),  let  the  logical  value  of  the  Boolean  expression  Z\X}; 
Y/,oy]  be  a,  a  e  10, )  }.  Now,  if  the  output  of  the  network  when  this 
test  is  applied  is  5,  we  say  the  network  has  failed  the  test;  oth¬ 
erwise  we  say  the  network  has  passed  the  test. 

A  fundamental  interrelationship  which  exists  between  fault 
dependent  test  sets  and  standard  test  sets  is  described  in  the 
following  theorem. 

Theorem  1: 

TiF,\Fj)  =  T(Fj)  9  T(F,  U  Fj). 

Proof:  Since  Z|X;Y/,Y,/)  ffi  Z\X\Y(,Yj]  =  0,  it  is  clear  from 
(2)  that  T(F/\Fj)  =  \X\{Z\X;Yj,aj]  9  Z\X;a,,aj))  ©  (Z|X; 
Y,.Yj)  9  ZlX-.V/.Yj])  =11. 

However,  by  simply  rearranging  the  above  equation,  we  ob¬ 
tain 

T(F,\Fj)  =  IXKZlX-.Y/.oj]  ©  Z\X;Y,,Yj ]) 

©  (ZlX;a/,«J]®Z[X;Y/,yj))=  I| 
or 

TlF,)Fj)  =  TtFj)  9  T(Fj  Fj).  Q.E.D. 

Example  I To  illustrate  the  above,  consider  the  network  of 
Fig.  1.  Suppose  Af  =  |1,2,3,5|.  Let  1  =  |l,2|and«;  =  |rt,,a2|  =  (1,1}, 
and  let  J  =  |3,5|and  aj  =  lo.-1.q5l  =  }0,1|.  It  can  be  seen  thatZJX; 
Ym]  =  ( Y3X4  +  Y2)Yi  +  (Y3x4+  Y2HY5  +  x4).  Suppose  now  that 
Fj  is  known  to  exist  in  the  network,  and  we  want  to  determine 
whether  F/  also  exists.  By  using  (2),  we  get 

T(F,\Fj)  =  |X|Z(X;Y|,crjl  «  Z\X>a,,aj\  =  1| 

=  |X|I|  I?  =  1}. 

We  now  choose,  say,  X )  =  (0000)  6  T(Ft\Fj)  and  then  determine 
that  Z|X  r,Y/,aj)  =  0.  Now  if  we  apply  Xi  to  the  circuit  con¬ 
taining  Fj,  the  output  would  be  a  I  if  F/  were  present  and  a  0 
otherwise.  Also  note  that  since  T(F/  (J  Fj)  =  |X|xiX2  +  x  jx  2X3X4 
+  X2X3X4  =  1|  and  T(Fj)  =  |X|xjx2X3x4  +  x,x2x3x«  =  1|,  by 
Theorem  1,  we  have  T(F/|Fj)  =  T{Fj)  ©  TiF/  U  Fj)  =  |X(x|X2 
«  1|. 


1  Fif  1  i»  uk*n  from  Ku  »nd  Mawm  |l) 


Fig.  1.  Z\ X;0j  *  zxx3  4  x3x&4  4  x  1X2X3  4  *,x4. 


The  fault  dependent  test  set  will  be  seen  in  the  following  sec¬ 
tion  to  be  fundamental  to  the  characterization  of  the  resolu¬ 
tion-oriented  fault  interrelationships.  Theorem  1  then  indicates 
a  straightforward  means  by  which  standard  testing  information 
can  be  used  to  consider  the  existence  of  such  fault  interrela¬ 
tionships  in  actual  situations.  However,  Theorem  1  also  gives 
significant  insight  into  the  concept  of  a  fault  dependent  test,  as 
it  indicates  that  there  are  really  two  types  of  fault  dependent 
tests:  one  type  of  which  seems  reasonably  satisfying  intuitively, 
the  other  of  which  seems  significantly  less  so.  To  illustrate  this, 
let  Xj  £  T(F/\Fj)  and  let  the  output  of  the  fault-free  network 
with  Xj  applied  to  it  be  ZIXl.Y/.Yj]  =  1.  Now,  we  will  consider 
the  following  two  cases. 

Case  1:  Let  A']  be  such  that  Z{X];Yi,aj ]  =  1  and 
Z|Xi;a/,oj]  =  0. 

Since  X  j  truly  tests  [in  the  sense  of  (1)]  for  F,  (J  Fj  while  not 
testing  for  Fj,  this  test  seems  to  agree  with  what  could  be  natu¬ 
rally  accepted  as  a  fault  dependent  test  for  F/  in  the  presence  of 
Fj. 

Case  2:  Let  X i  be  such  that  Z[X y.Y/.oj]  =  0  and 

Z|Xi;oi,»j]  =  1. 

However,  in  this  case,  since  Xi  is  a  test  which  does  not  take  into 
account  the  effect  of  the  fault  F/  relative  to  the  fault-free  net¬ 
work,  such  a  test  is  intuitively  less  satisfying.  It  is  this  interesting 
aspect  of  a  fault  dependent  test,  however,  which  makes  it  a 
powerful  tool  in  fault  resolution  considerations.  Note  that  by 
Theorem  1,  any  fault  dependent  test  is  aKvavs  one  of  the  above 
two  types. 

IV.  Resolution-Oriented  Fault  Interrelationships 

There  exists  an  inherent  limit  to  the  degree  to  which  a  fault 
situation  can  be  resolved  within  a  master  set,  and  this  limit  de¬ 
pends  upon  the  fault  interrelationships  which  exist  among  the 
faults  within  this  set  and  the  knowledge  regarding  the  existing 
fault  situation  which  has  thus  far  been  attained.  By  means  of  the 
results  of  the  previous  section,  we  are  now  able  to  characterize 
pertinent  fault  interrelationships  which  describe  these  limita¬ 
tions.  To  do  so,  we  will  consider  the  master  set  of  faults  Fm.  *"<1 
then  describe  the  fault  interrelationships  between  subset*  of 
faults  within  this  master  set,  say,  F/  and  Fj.  Note  that  it  is  not 
necessary  that  F;  U  Fj  =  F*f. 

Our  characterizations  of  these  resolution-oriented  fault  in¬ 
terrelationships  will  be  given  by  means  of  the  fault  dependent 
Boolean  difference.  Hence,  we  will  immediately  have  algebraic 
formulas  with  which  to  verify  whether  these  conditions  actually 
exist  within  a  network  for  some  subsets  of  a  given  master  set  of 
faults. 
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Definition  1:  Fj  completely  masks  F/,  denoted  Fj  „  F/.  if  and 
only  if  T(F/IFj)  =  0,  where  0  denotes  the  empty  set 

In  words,  we  say  that  fault  situation  Fj  completely  masks  F/ 
if  and  only  if  it  is  not  possible  to  detect  Fj  when  it  is  known  that 
Fj  exists  in  the  circuit 

Friedman  (7),  Hayes  [8],  Gault  et  al.  |9),  and  Wang  (10]  among 
,  others  have  reported  on  a  masking  relationship  among  faults, 
which  we  will  refer  to  in  the  following  as  classical  masking.  Given 
F/F'i  U  Fj.T(F,).  and  T(F/  U  Fj),  if  for  some  test  X\  a  T(F/), 
we  have  that  Xt  $  T(F/  U  Fj),  it  is  said  that  Fj  classically  masks 
F/  relative  to  the  test  and  if  this  is  true  for  all  X  e  T(Fj)  we  then 
say  that  Fj  classically  masks  F/.  Clearly,  this  is  not  equivalent 
to  saying  that  Fj  completely  masks  F/,  since  it  is  still  possible 
that  the  fault  conditions  Fj  and  F]  tj  Fj  can  be  distinguished. 
Since  classical  masking  describes  a  situation  where  tests  for  Fj 
are  rendered  useless  by  the  addition  of  Fj  to  the  fault  condition, 
classical  masking  might  be  more  accurately  described  as  test  set 
nullifying,  and  in  the  following  we  will  denote  this  classical 
masking  relationship  as  Fj  Jf  F/. 

The  concept  of  complete  masking  has  also  been  suggested  by 
Cha  and  Metze  (12].  They,  however,  choose  to  refer  to  it  as  m- 
covering.  They  then  coup le  this  m  -covering  concept  with  test  set 
nullifying,  and  refer  to  that  resulting  concept  as  masking. 

Example  2:  To  illustrate  the  above,  consider  the  network  of 
Fig.  2.  Let  I  =  (5,6|  and  a/  =  Jas.aet  =  {0,1},  and  let  J  =  |13|  and 
oj  =  lanl  =  !0|.  It  can  be  seen  that  T{F/)  =  JAT (x,aro  ©  x^xj  =  lj 
and  T(F,  U  Fj)  «  |X|iix2  ©  xax.,  =  1|.  Since  T(F ,)  D  T(F,  U, 
Fj)  =  0,  it  is  clear  that  Fj  Jj  Fj.  However,  by  Definition  1, 
T(Fj\Fj)  =  \X\xii2Xyi4  =  1]  X  0.  Hence  Fj  jtjFi-  Thus,  if  it  were 
known  that  Fj  existed  in  the  network  of  Fig.  2,  it  would  still  be 
possible  to  determine  whether  F;  also  existed.  For  example,  if  we 
applied  the  fault  dependent  test  X  =  (1100)  to  the  network 
containing  Fj,  the  output  would  be  a  0  if  F/  also  were  present, 
and  a  1  otherwise.  It  should  be  noted  that  the  fact  that  Fj  F/ 
does  not  necessarily  imply  that  F,  Fj.  Hence,  although  the 
concept  of  complete  masking  is  an  indication  of  a  limit  to  which 
•  resolution  can  be  achieved  with  algebraic  terminal  experiments, 
it  should  also  now  be  clear  that  any  such  resolution  testing  must 
take  into  account  more  than  just  one-way  masking  interrela¬ 
tionships  before  definite  conclusions  concerning  the  existing  fault 
situation  within  the  master  set  can  be  made.  The  next  fault  in¬ 
terrelationship  to  be  described  moves  further  in  that  direc¬ 
tion. 

Definition  2:  F;  is  unresolvable  from  Fj,  denoted  F/  Jf  Fj,  if 
and  only  if  T(F,\Fj)  =  T(Fj\F,)  =  0. 

In  words,  we  say  that  fault  situations  Fy  and  Fj  are  unresolv¬ 
able  if  it  is  not  possible  to  detect  F/  in  the  presence  of  Fj  and  if 
it  is  not  possible  to  detect  Fj  in  the  presence  of  F/. 

The  concept  of  unresolvability  might  seem  to  be  similar  to 
McCluskev  and  Clegg’s  concept  of  functional  fault  equivalence 
(11).  The  differences  between  the  two  concepts  are  a  result  of  the 
standard  testing  and  fault  detection  (to  which  functional 
equivalence  relates)  and  fault  dependent.testing  and  fault  reso¬ 
lution  (to  which  unresolvability  relates)  considerations,  as  will 
now  be  shown.  Given  the  faults  Fj  and  Fj  if  Z\X;a/,Yj)  = 
Z(A’;F/,aj|  it  is  said  that  the  two  faults  are  functionally  equiv¬ 
alent,  denoted  F/  ~  Fj  It  is  clear  that  F/  ~  Fj  if  and  only  if  T(F/) 
=  T(Fj)  But  using  the  definition  of  unresolvability  and  Theorem 
1,  we  have  that  Fj  %  Fj  implies  that 

r(F/|F,)  =  T(Fj\F,)  =  0 
or 

T(F/)  ©  TiF,  U  Fj)  =  T(Fj)  ©  T(F,  U  F j)  =  0 
which  implies 

T(F/)  =  TiFj)  =  T(F/  U  Fj). 

Thus,  the  functional  equivalence  of  two  fault  situations  is  a 


necessary  but  not  sufficient  condition  for  the  unresolvability  of 
those  fault  situations. 

Example  32:  To  illustrate  the  above,  consider  the  circuit  of  Fig. 
3,  and  let  M  =  |1,2,5,6|,  /  =  {2,6},  J  =  {1,5},  a,  =  |0,1|,  and  aj  =  |1,0|. 
Since  Z\X\ai,Yj)  =  Z\X\Yi,aj\  =  *2X3,  we  have  thatF/  ~Fj. 
However,  note  that  T(Fj  |F;)  =  |X  IX2X3  =  1  j,  and  therefore,  these 
two  double  faults  are  resolvable.  In  other  words,  if  it  were  known 
that  the  fault  F/  were  in  the  circuit,  then  if  we  applied  X 1  = 
(0110)  to  the  circuit,  the  output  would  be  a  0  if  Fj  also  existed  and 
a  1  otherwise. 

In  the  remainder  of  this  section,  we  will  give  some  useful  re¬ 
marks  involving  unresolvability,  functional  equivalence,  complete 
masking,  classical  masking  (test  nullifying),  and  fault  dependent 
testing.  These  remarks  are  intended  to  make  more  salient  certain 
important  features  of  our  fault  interrelationships. 

Remark  1:  Given  the  three  fault  situations  F/,  Fj,  and  F*: 

a )  Fj  ~  F/  U  Fj  if  and  only  if  Fj  %  Fi  U  Fj. 

b)  T(F,\F,  U  Fj)  =  WjIF/  u  Fj)  =  0. 

c)  T(F,UFj\Fk)  =  T(Fj\Fk)  ©  T(F,\Fj  U  FH). 

d)  TtF/IFj)  ©  T(Fj\F,)  =  T(F,)  ©  T(Fj). 

The  proofs  of  these  follow  directly  from  Theorem  1  and  the 
definitions  of  this  section.  Remark  la)  gives  the  conditions  under 
Which  functional  equivalence  &nd  unresolvability  are  identical 
interrelationships.  In  other  words,  if  two  fault  situations  are 
equivalent,  and  one  of  these  fault  situations  is  a  subset  of  the 
other,  then  those  two  fault  situations  are  unresolvable.  Remark 
lb)  asserts  that  it  is  not  possible  to  test  for  a  subset  of  a  specified 
fault  situation  when  it  is  assumed  that,  indeed,  the  entire  speci¬ 
fied  fault  situation  exists  in  the  circuit.  Remark  lc)  is  useful  in 
simplifying  expressions  involving  fault  dependent  tests  such  that 
all  sucb  tests  are  in  the  form  of  (2)  wherein  the  assumed  fault 
situation  and  the  tested  for  situation  are  disjoint.  For  example, 
to  determine  T(F;  U  Fj\Fj  U  Fj_)  we  can  use  this  Remark  lc). 
Letting  K  =  J  U  L,  and  noting  that 

T(F,  U  FjIFj  U  Fi)  =  T(Fj\Fj  U  FL)  ©  T(F,\Fj  U  FL) 

=  T(F,\FjVFl), 

since,  by  Remark  lb),  T(Fj|Fi  U  Fi)  =  0.  Finally,  Remark  Id) 
indicates  a  relationship  which  exists  between  fault  dependent 
test  sets  and  the  composing  fault  test  sets  which  does  not  involve 
the  composite  fault  test  set.  This  Remark  Id)  makes  apparent 
an  interesting  property  of  redundancy.  Suppose,  for  example, 
that  F;  and  Fj  are  redundant  fault  situations.  Then  T(F/)  = 
T(Fj)  =  0.  By  Remark  Id)  this  means  that  T(F;  |Fj)  =  F(Fj|F; ), 
and,  if  these  two  fault  dependent  test  sets  are  not  empty,  the 
composing,  redundant  faults  are  resolvable  in  that  it  is  possible 
(with  the  same  tests)  to  detect  either  of  the  redundant  faults  in 
the  presence  of  the  other.  This  is,  then,  an  algebraic  assertion  of 
a  fault  phenomena  first  noted  by  Friedman  |7]. 

We  will  now  consider  the  connection  between  complete 
masking  and  classical  masking  or  test  set  nullifying.  We  know 
that  if  Fj  jy  F/,  then  T(F/)  O  T(F )  U  Fj)  =  0.  But  from  this  it 
is  clear  that  T(Fi)  ©  T(Fj  U  Fj)  7*  0.  Hence,  we  have  that  Fj  £ 
F;  implies  that  F;  Fj.  In  other  words,  if  Fj  classically  masks 
Fi,  then  F/  does  not  completely  mask  Fj.  This  can  be  extended 
further  by  noting  that  if  we  have  that  F;  ff  Fj  and  Fj  *  Ft,  then 
clearly  F;  r  Fj 

The  above  result  significantly  indicates  the  difference  of  the 
fault  interrelationships  which  are  pertinent  to  fault  detection  and 
fault  resolution.  In  terms  of  fault  detection  if  two  fault  situations 
classically  mask  each  other,  they  form  what  is  referred  to  as  an 
undetectable  fault  set  |9)  in  that  the  composite  fault  situation 
cannot  be  detected  with  tests  from  the  test  sets  for  either  of  the 
composing  fault  situations.  However,  we  have  seen  here  that  in 


*  Fir  3  >nd  Example  3  are  taken  from  Cha  and  Meue  |12). 
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Fig.  3.  Z|X:9)  ■  xiijij  + 


terms  of  fault  resolution,  these  two  fault  situations  are  then  re¬ 
solvable.  This  is  summarized  in  the  following. 

Remark  2: 

a)  Ft  s  Fj  and  Fj  *  Fi  implies  F/  *  Fj. 

b)  Fi  a  Fj  and  Fj  „  F,  implies  F/  ?  Fj. 

V.  Fault  Interrelationship  Ramifications 

Whenever  a  test,  standard  or  fault  dependent,  is  applied  to  a 
network,  the  information  contained  in  the  outcome  of  that  test 
is,  in  general,  quite  limited  in  terms  of  fault  resolution.  For  ex¬ 
ample,  t  we  determine,  say,  T(F , )  for  F/,  apply  a  Xi  €  T(Ft )  to 
the  network,  and  observe  that  the  network  fails  the  test,  we  can 
only  conclude  that,  indeed,  the  network  is  faulty.  Similarly,  if  the 
network  passes  the  test,  we  can  only  conclude  that  Fj  is  not 
presenL  Analogous  statements  could  be  made  for  fault  dependent 
testing.  Therefore,  in  order  to  looale-or  resolve  the  existing  fault 
in  a  network,  a  sequence  of  tests  must  necessarily  be  applied. 
However,  even  with  the  application  of  a  sequence  of  tests,  a  point 
will  be  reached  where  further  resolution  is  not  possible.  We  are 
considering  in  this  correspondence  an  approach  to  fault  resolu¬ 
tion  based  on  algebraic  terminal  experiments.  For  a  set  of  master 
faults  in  a  network  and  using  this  approach,  the  limit  of  resolution 
which  can  be  attained  is  dependent  upon  the  type  of  test  used  and 
the  existing  fault  interrelationships  within  the  master  set.  It 
should  be  clear  that  higher  resolution  is  possible  if  fault  depen¬ 
dent  testing  is  used  instead  of  standard  testing,  since  fault  de¬ 
pendent  testing  takes  into  account  fault  situation  information 
not  utilized  by  standard  tests.  In  the  following  we  will  specify  the 
limits  of  resolution  attainable  with  fault  dependent  testing  in 
terms  of  the  fault  interrelationships  which  were  characterized 
in  the  previous  section.  It  will  be  seen  that  in  terms  of  resolution 
limits:  unresolvability  is  to  fault  dependent  testing  as  functional 
equivalence  is  to  standard  testing. 

Suppose  that  in  a  network,  we  have  that  the  existing  fault  in 
either  Ft,  Fj.  or  F/  U  Fj.  Now,  suppose  further  that  F;  jf  Fj.  We 
recall  from  the  previous  section  that  this  implies  that  T(F/)  = 
TiFj )  =  T(F/  U  Fj).  This  means,  then,  that  if  we  applied  stan¬ 
dard  tests  for  Fi,  Fj,  or  Fi  U  Fj,  we  could  not  resolve  the  fault 
situation.  However,  suppose  further  that  by  some  means  it  were 
possible  to  either  inject  F /  in  the  network  or  sense  that,  indeed, 
Fi  existed  in  the  network,  or  to  inject  Fj  in  the  network  or  sense 
that,  indeed,  Fj  existed  in  the  network.  Regardless,  it  would  still 
not  be  possible  even  with  fault  dependent  testing  to  resolve  the 
fault  situation  further  since  Ft  ?  Fj.  However,  if  we  instead 
suppose  that  Ft  Fj,  but  where  F/  #  Fj.  and  we  again  had  some 
means  of  externally  probing  or  sensing  conditions  in  the  network 
on  the  lines  of  I  or  J,  then  in  this  case  it  may  be  possible,  in  gen¬ 
eral,  to  further  resolve  the  existing  fault  situation  since  the  im¬ 
plication  of  functional  equivalence  is  simply  that  T(Fj )  =  T(Fj) 


i*  T(Fj  U  Fj).  Hence,  unresolvabilily  can  be  considered  to  be 
an  interrelationship  among  faults  which  limits  the  degree  of 
resolution  which  can  be  attained  when  testing  is  performed  with 
more  information  regarding  the  existing  fault  situation  than  is 
generally  used  with  standard  test  set  approaches.  Hence,  the 
parallel  between  functional  equivalence/standard  testing  and 
unresolvability/fault  dependent  testing  is  clear. 

It  might  be  noted  here  that,  indeed,  if  we  did  have  some  ex¬ 
ternal  means  of  injecting  faults  into  the  network  on,  say,  some 
lines  K  of  the  circuit,  and  we  had  the  case  that  Fi  1  Fj,  but  that 
Fy  u  Fj,  and  Fy  «  F/,  then  we  could  resolve  the  fault  situation 
further.  As  an  illustration  of  this,  consider  the  following. 

Example  4:  It  can  be  observed  in  the  network  of  Fig.  1  that  if 
/  =  (5|  and  o/  =  I05I  =  11),  and  J  =  |9j  and  oj  ~  |a9|  =  |1|,  then  Fj 
H  Fj.  Thus,  even  if  it  were  possible  to  inject  one  of  these  two 
faults,  say,  Fj,  it  is  easily  seen  here  that  it  still  would  not  be 
possible  to  determine  whether  Fi  also  existed  in  the  network. 
However,  if  we  could  inject  some  other  fault,  say,  Fk,  K  =  |13|, 
ok  =  I013I,  =  101.  then  since  F*  Fj,  and  Fy  £  Fj,  applying  the 
input  combination  (1111)  e  T{Fj\Fk )  we  can  now  test  for  the 
fault  F/. 

A  complete  treatment  of  this  idea  of  enhancing  resolvability 
by  means  of  external  lines  to  the  network  which  can  be  used  to 
inject  fault  situations  is  beyond  the  scope  of  this  paper.  However, 
it  seems  that  the  functional  approach  to  fault  interrelationships 
being  utilized  in  this  correspondence  makes  clear  the  criteria 
which  must  be  satisfied  for  the  design  of  highly  resolvable  net¬ 
works. 


VI.  CONCLUSION 

In  conclusion,  we  have  presented  in  this  correspondence  a 
characterization  of  fault  interrelationships  which  limit  the  res¬ 
olution  of  a  fault  situation  from  within  a  master  set  of  faults  when 
we  have  certain  information  about  that  fault  situation  which 
allows  us  to  do  fault  dependent  testing  instead  of  standard 
testing.  This  has  been  done  by  means  of  an  extension  of  the 
Boolean  difference  to  fault  dependent  situations.  It  should  be 
clear  that  unresolvability,  the  main  interrelationship  of  this 
characterization,  is  really  only  another  point  on  a  spectrum  of 
fault  interrelationships  which  is  increasingly  being  seen  to  be  of 
significance  in  fault  analysis.  Certainly,  this  spectrum  will  be 
worthy  of  future  research. 
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which  is  tested  by  at  least  t  other  processors,  and  at  most  t  of  which 
are  faulty.  The  existing  fault  situation  is  always  diagnosed  if 
»  ^  2t  +  I  and,  in  some  cases,  can  still  be  diagnosed  if  n  <  2t  +  1. 
The  implementation  of  the  algorithm  is  straightforward  and  suitable 
for  microprocessor  applications. 

Index  Terms—  Diagnosis  algorithm,  fault  syndromes,  micro¬ 
processor,  modular  networks. 

Introduction 

Consider  a  general  model  of  a  multiple  processor  architecture 
consisting  of  n  digital  modules  denoted  V0.  Vu  U.-t  and 
some  associated  interconnection  design  denoted  DtJ.  These  mod¬ 
ules,  for  example,  could  be  n  processors  implementing  a  seg¬ 
mented  algorithm  [6],  Regardless  of  the  use  of  the  multiple 
processor  architecture,  we  will  assume  that  each  Vt  is  capable  of 
testing  the  other  V  J s  to  which  it  is  directly  connected  for  some 
specified  class  of  faults.  If  a  module  contains  any  such  fault,  we 
will  refer  to  it  as  faulty.  The  problem  we  will  study  in  this  paper  is 
the  diagnosis  of  an  existing  fault  situation  among  the  modules 
given  their  respective  testing  results.  This  problem  is  not  new  and 
has  been  examined  elsewhere  in  the  literature  [1],  [3],  [4],  [S],  [7], 
[8J.  The  results  to  be  presented  here  represent  a  new  approach  to 
such  diagnosis.  In  particular,  the  diagnosis  procedure  described 
will  be  seen  to  be  sufficiently  straightforward  to  be  easily  im- 
plementable  on  a  simple  processor,  e.g.,  a  microprocessor,  and  for 
a  proper  interconnection  design  among  the  processors  and  upper 
bound  on  the  number  of  simultaneous  faults  which  can  occur,  will 
always  yield  the  correct  diagnosis  of  the  existing  fault  situation. 

Preliminaries 

Given  n  modules  V0.  Vt,  -,Vm.  ,,  we  will  denote  the  modules 
which  Vt  tests  by  U  fir4„  r  «  1,  2,  •  •  • ,  t  where  /(r,  i)  e  [0,  !,•••, 
n.  —  1],  i  =  0, 1,  •  ••,  n  —  1.  For  convenience,  we  will  always  assume 
that  V,  tests  itself  and,  regardless  of  its  state,  concludes  that  it  is 
fault  free.  The  outcome  of  the  test  of  module  by  module  U, 
will  be  denoted  a(i,/(r,  i))  where 

..  ,,  |0.  if  V,  concludes  that  V n, ,,  is  fault  free 

a(,’/(r’,))=|l,  otherwise. 

It  should  be  noted  that  the  conclusion  of,  say,  V,  regarding  the 
state  (faulty  or  fault  free)  of  the  modules  to  which  it  is  connected  is 
only  reliable  if  indeed  U,  is  fault  free.  If  with  each  module  Vt  we 
associate  a  test  table  B,,  i  =  0, 1,  •  •  • ,  n  -  1,  where  B,  represents  the 
conclusion  of  U,  regarding  the  states  of  all  the  modules,  we  have 
the  problem  of  determining  the  existing  fault  situation  based  on 
the  available  test  results.  Whether  or  not  this  is  feasible  clearly 
depends  on  the  number  of  faults  and  the  interconnection  design. 
We  will  assume  in  the  following  that  at  most  t  modules  can  be 
simultaneously  faulty  and  that  every  module  is  tested  by  at  least  f 
other  modules.  Under  some  assumptions  on  the  interconnection 
design,  Preparata,  Metze,  and  Chien  (7]  have  shown  that  it  is 
feasible  to  diagnose  any  valid  fault  situation  However,  the  diag¬ 
nosis  algorithms  which  have  been  proposed  to  do  so  are  quite 
complex  [1],  [4],  [5).  We  propose  here  a  new  diagnosis  algorithm 
for  this  problem.  For  the  purpose  of  explanation  we  will  assume  in 
the  following  that  the  interconnection  design  between  the  mod¬ 
ules  is  the  so-called  Dt  J  design  of  [7J.  wherein  there  is  a  testing 
interconnection  from  V,  to  V ,  if  and  only  if;  -  i  *=  m  (modulo  n) 
and  m  assumes  the  values  from  I  to  r.  The  results  presented  here 
have  been  extended  to  more  general  interconnection  designs,  but 


since  they  are  descriptively  cumbersome,  these  extensions  will  not 
be  detailed. 

Diagnosis  Algorithm 

Each  test  table  Bf  has  components  Bli0,  Bl-t,  •••.  B,._ ,  where 
Bi  j  represents  the  conclusion  of  module  Ut  regarding  the  state  of 
module  Uj.  If  module  Vt  ‘‘believes"  that  module  V  t  is  fault-free, 
then  B,j  is  set  to  the  value  0;  otherwise,  B,j  is  set  to  the  value  I. 
Suppose  that  B0,  B,,  •■,  B„_ ,  are  complete  in  the  sense  that  every 
module  has  a  conclusion  regarding  the  state  of  each  of  the  mod¬ 
ules  Vi.  i  =  0, 1,  ■■■,n  —  1.  We  will  assume  here  that  if  a  module  is 
fault-free,  its  corresponding  table  is  correct. 

Lemma  I :  There  exists  at  least  n  —  t  of  the  B,  tables  which  are 
identical. 

Proof:  Since  at  most  r  modules  are  faulty,  and  since  a  table 
corresponding  to  a  fault-free  module  correctly  describes  the  fault 
situation,  the  theorem  follows. 

Lemma  2:  If  there  exists  only  one  set  of  identical  tables  B,IU, 
Bniy,  •  Bi  such  that  s  >  n  -  t,  then  each  of  these  tables  in  this 
set  correctly  describes  the  existing  fault  situation. 

Proof:  We  already  know  that  there  exists  at  least  n  —  t 
correct,  and  therefore  identical,  tables.  Therefore,  if  only  one  set  of 
identical  tables  has  a  cardinality  larger  than  or  equal  ton  -  t,  this 
set  must  consist  of  the  correct  tables. 

It  should  be  clear  that  no  conclusion  can  be  made  regarding  the 
fault  situation  if  there  exists  more  than  one  set  of  identical  tables 
with  cardinality  larger  than  or  equal  to  n  —  f. 

Theorem  1 :  Suppose  that  n  >  2r  +  1 ;  then  there  exists  one  and 
only  one  set  of  identical  tables  with  cardinality  larger  than  or 
equal  to  n  -  t. 

Proof:  Suppose  that  n  ^  2t  +  1  and  assume  that  there  exist 
two  sets  of  identical  tables  of  cardinality  n,  and  n2.  respectively. 

Assume 

n,  >  n  -  t 
and 

n2  >  n  -  t. 

Then 

n,  +  n2  >  2n  -  2t. 

We  know  that 

n  >  n,  +  rij 

and  therefore 

n  >  2n  —  2 1. 

This  inequality,  used  in  conjunction  with  n  >  2r  +  1.  yields 

2n  >  2n  +  1 

and  we  conclude  that  we  cannot  have  two  sets  of  identical  tables 
of  cardinality  larger  than  or  equal  to  n  -  t  when  n  >  2/  +  I. 

At  this  point  we  need  an  efficient  procedure  to  build  the  com¬ 
plete  n  tables  B0.  B|.  ■  •  B,_  i  such  that  if  module  U,  is  fault-free, 

then  the  table  B,  reflects  accurately  the  fault  situation  of  the  mul¬ 
tiple  processor  architecture.  Such  an  algorithm  is  presented  in  the 
following  to  compute  the  tables  B0,  B,.  -  •-,  B,.,. 

Algorithm  I :  Let  i  in  (0, 1,  ■  •  • .  n  -  I]  and  r  in  [1. 2,  ■  •  • .  n  —  1]  be 
given. 
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Step  0:  Set  fi,  ,  =  0  for  m  =  0,  1.  ,  n  -  1.  set  j  =  i,  set 

k  =  i  +  I.  and  set  Nf  =  0. 

Step  I :  If  Nf  >  t,  stop:  else,  go  to  Step  2 
Step  2:  If  k  =  i,  stop;  else,  go  to  Step  3 
Step  J:  If  a(j.  k )  =  1.  set  B,t  =  I.  set  Nr  =  Nr  +  I.  and  goto 
Step  4:  else,  set  j  =  k  and  go  to  Step  4 

Step  4:  Set  k  -  k  +  I  and  go  to  Step  1. 

Notes: 

1)  All  additions  are  performed  modulo  n. 

2)  We  assume  a  D, ,  interconnection  design,  i.e.,/(r,  j)  =  j  +  r 
(modulo  n),  and  therefore,  we  use  the  notation  a(j,j  +  r)  instead 
of  the  more  general  notation  a(j,f(r,j)). 

Theorem  2:  If  a  D, ,  interconnection  design  is  used,  if  the  max¬ 
imum  number  of  faults  which  may  occur  is  f,  and  if  module  12,  is 
fault-free,  then  the  table  B,  constructed  by  the  algorithm  ac¬ 
curately  reflects  the  existing  fault  situation. 

Proof:  We  need  to  show  that  the  algorithm  is  well  defined 
and  that  it  produces  tables  B,  which  are  correct  whenever  U,  is  not 
faulty.  The  technique  we  use  to  prove  the  theorem  is  based  on  the 
use  of  invariant  assertions  as  described  in  [2]  (see  Fig.  1 ). 

We  assume  that  a  D,.,  interconnection  design  is  used,  i.e„ 
module  (2,  tests  the  modules  f2,,  ,,  12,,  2,  ■  ■  .  12,,,.  The  algorithm 
uses  the  quantity  a(j.  k)  which  contains  the  result  of  the  test  of 
module  k  by  module  j.  It  follows  that  the  algorithm  is  well  defined 
if  and  only  if  j  and  k  are  related  by 

k  =  j  +  r 

where  r  is  some  integer  in  (1.  2,  ■  • ,  f]. 

Assume  that  before  executing  Step  3.  the  following  assertion 
holds: 

(Al)  j+\<k<j+\  +  Nr. 

Then  it  can  be  shown  that  (A  1)  still  holds  after  the  execution  of 
Step  4  Clearly.  (Al)  is  satisfied  by  the  initial  values  given  to  j,  k. 


and  Nr,  and  therefore  we  conclude  that  (Al)  is  always  satisfied 
before  the  execution  of  Step  3. 

It  is  only  possible  to  reach  Step  3  if 

Nf  <  t. 

It  follows  that  just  before  the  execution  of  Step  3.  the  quantities  j 
and  k  are  related  by  the  assertion 

(A2)  j  +  I  <  k  <  j  +  t 

which  shows  that  the  algorithm  is  well  defined. 

The  first  part  of  the  proof  showed  that  the  algorithm  is  well 
defined.  We  now  prove  that  if  Ut  is  fault-free,  then  the  table  B, 
reflects  the  actual  fault  situation.  Following  again  the  approach 
described  in  [2],  we  show  that  the  following  assertions  are  always 
satisfied  before  the  execution  of  Step  3. 

(A3)  The  module  12,  is  not  faulty. 

(A4)  B,  accurately  reflects  the  existing  fault  situation  up  to 
k  —  1,  i.e.,  for  all  m  in  [i.  i  +  1.  i  +  2,  k  -  1]: 

Bf„  =  0  if  and  only  if  module  Um  is  not  faulty 

and 

B,.„  =  1  if  and  only  if  module  Um  is  faulty. 

(A5)  Nr  contains  the  number  of  faulty  modules  up  to  it  -  l.i.e.. 

Nr  =  ‘l  Bim 

m  *  t 

It  can  be  shown  that  if  (A3),  (A4),  and  (A5)  are  true  before  the 
execution  of  Step  3.  then  they  are  still  true  after  the  execution  of 
Step  4.  Clearly,  (A3),  (A4),  and  (A5)  hold  after  the  execution  of 
Step  0.  and  therefore  we  conclude  that  (A3),  (A4),  and  (A5)  are 
always  true  before  the  execution  of  Step  3. 

Now,  suppose  that  the  algorithm  stops  in  Step  1.  We  know  that 
B,  is  correct  up  to  k  and  that  Nr  =  t.  In  other  words.  B,  m  correctly 
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Fir  2  P,  j  interconnection  of  nine  modules;  modules  V l/j.  and  (J6  faulty 


reflects  the  fault  situation  for  m  =  i,  i  +  1,  ■  •  • ,  k  and  t  faults  have 
been  detected.  But  we  have  assumed  that  at  most  t  faults  may 
occur,  and  therefore  this  implies  that  the  remaining  modules  are 
,no!  faulty.  The  B,.„  for  m  =  k  +  1,  k  +  2.  ■■■ .  i  —  1  are  equal  to  0. 
and  therefore  the  complete  table  B,  is  correct. 

Suppose  that  the  algorithm  instead  stops  in  Step  2;  then  B,  is 
correct  up  to  k  =  i,  and  therefore  B,  is  correct. 

Although  we  have  shown  that  when  the  algorithm  stops,  it 
produces  the  correct  table,  it  remains  to  be  shown  that  it  indeed 
stops  after  a  finite  number  of  iterations.  We  note  that  k  takes  the 
values  i,  i  +  1,  i  +  2,  •  •  ■ ,  and  therefore  if  the  algorithm  does  not 
stop  in  Step  1,  it  must  necessarily  stop  in  Step  2.  This  concludes 
the  proof  of  the  theorem. 

Note  that  table  B,  does  not  contain  any  don’t  care  entries. 
This  occurs  because  table  B,  is  constructed  by  using  not  only  the 
test  results  of  module  U but  also  the  test  results  of  other  mod¬ 
ules.  For  example,  if  f/,  does  not  test  directly  Uj.  but  U,  tests  t/k 
and  finds  Ut  correct,  then  U,  accepts  the  conclusion  of  U*  regard¬ 
ing  Uj . 

Accelerated  Algorithm 

The  diagnosis  of  the  set  of  faulty  modules  based  on  the  results 
of  Lemmas  1  and  2  and  Theorem  1  requires  that  the  table  B„ 
i  =  0,  1,  ■  •  • ,  n  —  1  be  compared.  This  process  is  time  consuming 
and  may  be  avoided.  For  each  ;  =  0,  I —  1.  let  y,  be  the 
number  of  indices  i  for  which  B,,;  =  1.  i.e., 

■/■j  =  cardinality  of  {i  <=  [0,  1,  •  ••,  n  -  lj  j  B,j  =  I}, 

then  these  quantities  may  be  used  in  a  diagnostic  algorithm  as 
follows. 

Algorithm  2:  Let  f  in  [1,  n  -  1]  be  given. 

StepO:  Compute  the  tables  B0.  B,.  ■.  B._,  by  using  Algo¬ 
rithm  1. 

Step  I :  Compute  the  quantities  y0.  Vi.  "  , 1 


Step  2:  Let  V  =  \j  e  [0,  1,  ••  • ,  n  —  1  ]  ( */j  >  t  +  1J 

Note  that  if  the  tables  B,  are  arranged  to  form  an  n  x  n  array 
where  B,  is  the  iih  row.  then  yt  is  just  the  number  of  l's  in  column 
) 

Theorem  3:  If  a  D,  ,  interconnection  design  is  used,  if  the  max¬ 
imum  number  of  faults  which  may  occur  is  t.  and  if  n  a  2t  +  1. 
then  Uj  is  faulty  if  and  only  if  j  is  in  F. 

Proof:  The  result  is  a  direct  consequence  of  Lemmas  1  and  2 
and  Theorems  1  and  2. 

Algorithm  2  is  well  suited  for  implementation  on  a  micro¬ 
processor.  For  example,  on  an  Intel  8080  microprocessor,  the 
total  amount  of  memory  necessary  to  store  the  data  and  the  pro¬ 
gram  in  the  case  n  =  8  and  t  =  2  is  176  words  of  8  bits.  i.e..  1408 
bits. 

We  note  that  Algorithm  2  may  be  implemented  in  parallel  on  a 
network  of  N  microprocessors  with  N  <  n.  In  particular,  if  N 
microprocessors  are  used,  then  it  is  possible  to  compute  in  parallel 
all  the  tables  B,  and  all  the  quantities  ■; r  The  computational  time 
necessary  to  diagnose  the  network  of  n  modules  using  N  micro¬ 
processors  for  implementing  Algorithm  2  is  essentially  T|n/N]/n 
where  T  is  the  computational  time  necessary  to  execute  the  in¬ 
structions  of  Algorithm  2  when  a  single  microprocessor  is  used 
and  [n/N]  is  the  smallest  integer  larger  than  n!N.  The  simplicity  of 
Algorithm  2  gives  a  large  amount  of  flexibility  to  its  implementa¬ 
tion.  For  example,  one  could  construct  a  special  very  reliable 
arbitrator  device  which  would  receive  all  the  a(i.  J)' s  from  the 
modules  and  then  would  generate  the  diagnosis  of  the  system 
using  Algorithm  2.  The  high  reliability  of  the  arbitrator  could  be 
justified  because  of  the  simplicity  of  its  required  operation  relative 
to  the  complexity  of  the  modules  in  the  network.  Alternatively,  it 
would  be  possible  to  send  the  a{i,j)' s  to  each  module,  which  could 
in  turn  compute  the  tables  B,.  In  this  case,  each  module  could  then 
decide  for  itself  which  other  modules  are  faulty  and  act  ac¬ 
cordingly.  Clearly,  many  other  variations  for  implementation  are 
possible 
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Example 

In  order  to  demonstrate  the  simplicity  of  the  algorithms,  we 
apply  them  to  the  network  given  in  Fig.  2.  The  network  contains 
n  =  9  modules,  l  =  3.  i.e.,  at  most  three  modules  may  be  faulty, 
and  a  D ,.j  interconnection  design  is  used,  i.e.,  module  \J0  tests  V 
L  2.  and  V j.  module  U,  tests  V  2.  U3.  and  Ut,  etc.  Assume  that 
the  modules  L',.  Vs,  and  LI*  are  faulty  Fig.  3  contains  a  possible 
set  of  test  outcomes  The  application  of  the  algorithm  to  these  lest 
outcomes  yields  the  tables  B,.  i  =  0.  1.  2.  ■  ,  8  given  in  Fig.  4  We 
find  that  the  tables  B0.  B2.  B4.  Bf,  B7,  and  B8  are  identical  We 
have  six  identical  tables,  and  using  Lemma  2.  we  conclude  that 
these  tables  reflect  the  correct  fault  situation  of  the  network,  i.e., 
wc  conclude  that  the  modules  U V s.  and  V*  are  faulty.  Alterna¬ 
tively,  we  may  compute  the  quantities  i.e  .  y«  =  0,  yt  =  8, 
y2  =  0.  ,  j  =  8. ;  4  =  0,  y,  =  0,  y6  =  8.  y7  =  1.  and  yB  =  1,  and  then 
compute  the  set  V  =  [/ 1  yy  >4)  =  jl.  3,  6J  Using  Theorem  3. 
we  conclude  once  again  that  L'i.  Uj.  and  V*  are  faulty. 

CONCLISION 

An  approach  to  the  problem  of  fault  diagnosis  of  symmetric 
multiple  processor  architectures  has  been  proposed  It  consists  of 
constructing  tables  B„  assuming  that  the  corresponding  modules 
L,  are  not  faulty,  followed  by  a  voting  procedure  The  construc¬ 
tion  of  the  tables  B,  is  decoupled  in  the  sense  that  each  table  may 
be  constructed  independently  of  the  others  It  is  possible  to 
decrease  the  amount  of  computation  necessary  to  obtain  all  the 
tables  B,.  i  =  0.  1.  -  .  n  -  I  by  increasing  the  dependency  be¬ 


tween  the  construction  of  the  various  tables.  It  is  not  difficult  to 
find  schemes  in  which  the  construction  of  the  table  Vt  depends  on 
ihc  tables  V0.  U ,.  •••,  Uj . ,.  Such  schemes  are  more  complicated 
to  code  than  the  one  we  propose,  require  more  memory  to  store 
the  program,  and  do  not  lend  themselves  to  parallel  implementa¬ 
tion  Therefore,  we  feel  that  our  scheme,  i.e..  Algorithm  2  which 
has  a  lime  complexity  of  0(n2)  if  sequentially  implemented  and 
0(n)  if  implemented  on  a  network  of  n  microprocessor,  is  ideally 
suited  for  the  fault  diagnostic  of  D, ,  networks. 
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Recursive  Coverage  Projection  of  Test  Sets 

V1NOD  K.  AGARWAL  and  GERALD  M.  MASSON 


Abstract — In  the  generation  of  test  sets  for  the  detection  of 
stuck-type  faults  in  combinational  switching  networks,  it  is  an 
expedient  and  reasonably  common  assumption  to  consider  explicitly 
faults  only  of  specified  sizes  (for  example,  all  single  faults),  and  then 
to  assume  (or  hope)  that  most  or  all  faults  of  larger  sizes  will  be 
covered  (that  is,  detected)  as  welL  This  paper  systematically 
addresses  this  aspect  of  multiple  fault  coverage  in  a  quantitative 
manner  for  combinational  networks,  wherein  only  primary  input 
fanout  is  allowed.  A  procedure  is  given  to  estimate  (or  project)  the 
multiple  fault  coverage  capability  of  a  test  set  based  on  the  known 
coverage  capability  of  that  test  set  for  subsets  of  the  multiple  faults. 
This  is  accomplished  by  means  of  a  recursive  use  of  a  detailed 
formula  which  exploits  two  fundamental  interrelationships  between 
test  sets  and  faults.  Based  upon  these  results,  it  can  be  shown  that  the 
above-mentioned  assumption  must  be  made,  in  general,  with 
discretion  as  its  validity  is  highly  network  structure/lest  set 
dependent 

Index  Terms — Consistency,  coverage,  fault  detection,  internal 
fanout-free  networks,  multiple  faults,  recursive  projection. 

I.  Introduction 

The  magnitude  of  the  number  of  multiple  faults  which  can 
potentially  occur  in  any  logic  network  of  reasonable  size  usually 
prohibits  their  individual  consideration  in  test  set  generation. 
Nevertheless,  the  ability  to  test  a  logic  network  for  multiple  faults 
with  a  certain  degree  of  confidence  is  fundamental  to  the  reliabil¬ 
ity  aspects  of  a  wide  range  of  systems  composed  of  such  logic 
networks  [9]-[12].  Thus,  in  practice,  it  is  often  found  that  a  test  set 
will  be  generated  explicitly  for  only  a  subset  of  all  possible  faults 
which  can  occur  with  the  assumption  (or  hope)  that  most  other 
faults  are  covered  as  well.  Except,  however,  for  some  reasonably 
straightforward  results,  there  has  heretofore  been  little  work  pub- 
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iished  in  the  literature  which  theoretically  analyzes  this  assump¬ 
tion  in  a  quantitative  manner.  This  note,  therefore,  addresses  this 
problem  of  analyzing  the  coverage  capability  of  test  sets  for  mul¬ 
tiple  faults  of  sizes  larger  than  that  for  which  the  test  sets  were 
explicitly  generated. 

A  particularly  prominent  aspect  of  this  problem  is  the  case  in 
which  a  complete  single  fault  test  set  is  generated  for  a  network 
and  then  used  for  the  coverage  of  both  single  and  multiple  faults 
in  the  network.  The  following  example  illustrates  the  tenuous 
nature  of  such  an  approach. 

Example  I:  Consider  the  40-input,  single-output,  nonredun- 
dant,  combinational  network,  denoted  as  N.  shown  in  Fig.  1(a). 
Let  K  =  {1,  3,  6,  8, 1 1,  13,  16.  18,  21, 23,  26,  28,  31,  33,  36,  38)  be  a 
set  of  v  =  16  lines  in  N,  and  let  T%  be  the  single  fault  detection  test 
set  (SFDTS)  consisting  of  the  56  input  vectors  shown  in  Fig.  1(b). 
Now  it  can  be  seen  upon  investigation  of  this  network  that  T,  also 
covers  every  multiple  fault  involving  any  2  and  3  of  these  16  lines 
(indeed,  this  is  expected  [2],  [4])  and  approximately  90  percent  of 
the  multiple  faults  involving  any  4,  5,  and  6  of  these  16  lines. 
However,  if  we  consider  the  multiple  faults  simultaneously  involv¬ 
ing  all  16  lines,  then  it  can  be  seen  (on  the  basis  of  coverage 
projection  results  to  be  later  developed  in  this  paper)  that  of  the 
2“  =  65536  such  multiple  faults,  only  29822  of  these  faults  are 
covered  by  T,.  That  is.  more  than  one-half  of  all  the  multiple  faults 
involving  all  16  lines  of  K  can  be  shown  to  be  undetected  by  the 
given  Ts.  (It  is  interesting  to  note,  incidentally,  that  no  two  or 
more  of  these  216  multiple  faults  are  functionally  equivalent.) 
Thus,  we  see  that  for  this  network  and  these  16  lines,  the  multiple 
fault  coverage  capability  for  the  given  T,  is  reasonable  for  multiple 
faults  involving  up  to  6  lines,  but  that  for  the  multiple  faults  of 
greater  size,  this  capability  is  quite  tenuous. 

The  above  example  is  not  an  isolated  or  rare  situation  as  the 
perspective  given  by  the  results  to  be  developed  in  this  paper  will 
indicate.  Accordingly,  we  are  well  motivated  here  to  develop  a 
formal,  systematic  approach  for  evaluating  the  multiple  fault 
coverage  capability  of  test  sets. 

Finally,  the  fact  that  we  deal  in  this  paper  exclusively  with 
networks  wherein  only  primary  input  fanout  is  allowed  should 
not  be  viewed  as  overly  restrictive.  While  it  would  be  inappro¬ 
priate  here  to  detail  the  importance  of  their  position  in  the  general 
area  of  combinational  switching  networks,  it  is  clear  that  such 
networks  have  provided  and  will  continue  to  provide  a  useful 
framework  for  the  establishment  of  general  results  and  guidelines 
in  the  field  of  fault  analysis.  Moreover,  since  the  use  of  our  results 
leads  to  somewhat  of  a  warning  regarding  the  projected  multiple 
fault  coverage  capability  of  test  sets,  these  networks  serve  us  well, 
as  it  has  been  shown  [6],  (7]  that  the  presence  of  internal  fanout 
should,  if  anything,  decrease  this  coverage  capability.  It  should  be 
noted  that  the  class  of  internal  fanout-free  networks  considered  in 
this  paper  is  significantly  more  general  than  the  class  of  tree 
networks  wherein  no  fanout  is  allowed.  In  fact,  straightforward 
methods  to  determine  complete  multiple  fault  detection  test  sets 
for  tree  networks  are  well  known  [8}-(12J. 

II.  Preliminaries 

An  n-input,  single-output  internal  fanout-free  combinational 
switching  network  in  which  primary  input  fanout  is  allowed  will 
be  denoted  in  the  following  simply  as  N.  For  reference,  each  line 
of  N  will  be  labeled  with  a  distinct  integer.  Subsets  of  the  set  of 
lines  in  N  will  then  be  denoted,  for  example,  as  /  =  (i,,  •  ,  *,}, 
J  =  {j,.  ■  or  K  =  k»).  The  n  primary  input  variables 

X|,  x2,  •••,  x,  to  N  will  be  collectively  denoted  in  general  by  an 
n-tuple  input  vector  X.  We  will  use  X,  to  signify  a  particular  input 
vector  to  the  network  wherein  each  Xj  e  Xt,  1  <  j  <  n,  is  assigned 
a  specific  binary  value  of  either  0  or  I. 
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**<  “  (<V  <V  f  E  (0.  11  implies  that  the  AO-tuple  input  vector  X,  is  such 

that  the  components  *1  the  positions  ct. arc  all /f.  and  the  components  at  all 
the  other  remaining  position*  are  fi.  For  esample,  X„  -  (0. 1.0. 0.0. 0.0. 1. 1  0  0.0 

o.  o.  o.  o.  o.  o.  o.  o.  1. 1.  o.  o. o. o.  o.  1. 1.  o.  o.  o,  o,  o,  o,  o.  o.  o. o. oy 

Fig.  1  (a)  Network  for  Esample  I.  (b)  Test  Set  for  Esample  I 

The  Boolean  function  in  terms  of  the  primary  input  variables 
realized  by  any  line,  say,  f  in  N,  will  be  denoted  by  V,.  We  will  refer 
to  V,  as  the  internal  line  variable  for  line  t.  Similarly,  we  can 
succinctly  describe  the  set  of  Boolean  functions  corresponding  to 
the  set  of  internal  line  variables  of,  say.  these!  of  lines  K  =  {k,.  k3. 
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■  ■  ■ ,  It,}  as  V*  =  JY4l,  Vjj,  •  •  • ,  YJ.  The  primary  output  function  of 
N  can,  of  course,  be  expressed  entirely  in  terms  of  the  primary 
input  variables.  However,  for  our  purposes,  it  will  be  necessary  to 
also  express  the  primary  output  function  in  terms  of  internal  line 
variables.  Accordingly,  we  let  Z[X ;  Y*]  denote  the  primary  output 
function  of  N  when  it  is  written  in  terms  of  X  and  Y*.  For  exam¬ 
ple.  if  K  =  { 1, 2, 3, 4},  the  Boolean  function  Z[X ;  Y*]  for  a  network 
N  might  be  written  as  Z[X;  Yt]  =  (x,x2  Y,  +  x3)  (x4  Y})  + 
xi*j  Y3  +  }*  +  x2x3. 

Given  any  set  K  of  p  lines  in  N,  clearly,  there  are  2'  possible 
stuck-type  fault  events  that  can  be  associated  with  these  v  lines 
wherein  each  of  the  lines  is  either  stuck -at-zero  or  stuck-at-one. 
This  set  of  2"  such  fault  events  on  the  set  K  of  lines  in  N  will  be 
referred  to  as  the  fault  complex  on  K  and  will  be  denoted  as 
F»  =  {a*,  a*,  •  •  • ,  a*"}  where  each  a*  e  Fk  is  described  by  a  binary 
c-tuple.  In  the  presence  of  any  fault  event  a1*  e  F*  in  N,  the  result¬ 
ing  faulty  primary  output  function  of  N  can  be  written  as  Z[X ; 
a1*].  Clearly,  this  is  obtained  from  Z[X ;  Yt]  by  simply  replacing 
the  variables  of  Y*  in  the  function  by  their  corresponding  binary 
values  from  ait.  In  the  following,  whenever  there  are  no  chances  of 
ambiguity,  aK  will  be  used  in  place  of  a'r  to  denote  a  general 
element  of  F*. 

It  should  be  noted  here  that  because  we  are  considering  only 
internal  fanout-free  networks  where  primary  input  fanout  is 
allowed,  it  follows  that  there  are  dominating  interrelationships 
among  the  faults  on  the  lines  of  N  which  indicate  that  not  all 
subsets  of  lines  of  N  are  really  of  interest.  Accordingly,  we  will 
make  the  following  two  assumptions  on  the  members  of  sets  of 
lines  to  be  considered  in  this  paper:  first,  we  will  assume  that  any 
set  of  lines  we  consider,  such  as  the  set  K  discussed  above,  will  not 
contain  primary  input  fanout  stem  lines;  second,  we  will  assume 
that  any  set  of  lines  we  consider  will  contain  no  two  or  more  lines 
which  are  on  the  same  path  from  a  primary  input  to  the  primary 
output.  It  should  be  clear  that  these  assumptions  in  no  way  re¬ 
strict  the  consideration  of  fault  events  of  interest.  It  should  also  be 
noted  that  because  of  these  assumptions,  it  follows  that  the  pri¬ 
mary  output  function  Z[X ;  Y*]  for  any  set  K  of  lines  of  interest  in 
N  can  be  expressed  such  that  each  internal  line  variable  for  each 
t  e  K  occurs  exactly  once  in  the  function,  either  as  Y,  or  as  Y,. 
Finally,  it  should  be  clear  that  because  of  this  latter  property  of 
Z[X ;  Y*]  regarding  the  single  occurrence  of  each  Y,  e  Y*,  for  any 
set  K  of  lines  in  N,  we  have  that  Z[X ;  Y*]  can  be  written  as  a 
function  of  Boolean  subfunctions  such  that  none  of  the  Y,  var¬ 
iables  appears  in  more  than  one  of  the  subfunctions. 

More  particularly,  let  K  be  a  set  of  v  >  2  lines  in  N,  and  let  / 
and  J  be  disjoint,  nonempty  subsets  of  K  such  that  /  u  J  =  K. 
Consider  then  the  following. 

Definition  2.1 :  The  sets  /  and  J  are  said  to  form  a  2-partition  of 
K  if  there  exist  two  Boolean  subfunctions,  say.  H[X ;  Y>]  and 
M[X ;  Yj],  such  that  the  primary  output  function  of  N  can  be 
written  as  Z[X;  Y*]  =  H[X ;  Y,j  «  M[X;  Y,]  +  0[X]  where 
corresponds  to  either  “+"  or 

Example:  If  Z[X;  Y»]  =  (x,x2  Y|  +  x3)  •  (x4  Y2)  +  XiX3  Y}  + 
Y4  +  x2  Xj,  then  /  =  {1,  2}  and  J  =  {3,  4}  form  a  2-partition  of 
K  =  {1,  2.  3,  4}.  wherein  H[X ;  Y/j  =  (xi  x2  Y,  +  x3)  •  (x«  Y2). 
M[X;  Yj  =  x,x3  Y3  +  Y4,  and  0[X]  =  x2x5. 

Because  of  the  above-mentioned  assumptions  on  the  members 
of  sets  of  lines  considered  in  this  paper,  it  should  be  clear  that 
given  any  set  K  of  v  ^  2  lines  in  N.  there  will  always  exist  at  least 
one  2-partition  of  K.  This  concept  of  a  2-partition  will  play  an 
important  role  in  our  coverage  projection  theory.  Thus,  it  will  be 
convenient  in  the  following  to  refer  to  each  of  the  sets  I  and  J  as 
component  sets  of  K.  and  to  refer  to  each  of  the  fault  complexes  F< 
and  Fj  as  component  fault  complexes  of  F*.  Note  that  if  the 
component  sets  /,  say,  /  =  {i(,  i2.  ■  •  • ,  /,),  and  J,  say.  J  =  {ji.Ji.  '  • 


j,).  are  considered  as  ordered  vectors,  such  as  (i|,  ip)  and  (j ,, 
■  • ,  j,),  respectively,  and  if  the  set  K  is  accordingly  ordered  as  (i3. 
<2.  •••.  ip.ji.ji.  '.)«).  then  the  fault  complex  F*  will  be  isomor¬ 
phic  to  the  Cartesian  product  F/  x  Fj.  Throughout  this  paper,  the 
above-mentioned  ordering  in  sets  1,  J,  and  K  will  be  assumed  for 
the  sake  of  notational  convenience.  Also,  we  will  write  that 
F*  =  F/  *  Fj  where  it  will  be  understood  that  elements  of  F*  are 
u-tuples. 

Obviously,  we  could  also  define  r-partitions  for  r  >  2.  Indeed, 
this  has  been  done  in  [S].  But  the  accompanying  notational  com¬ 
plexity  outweighs  the  advantage  of  the  more  general  statements  of 
the  results  to  follow.  Moreover,  these  developments  using  only 
2-partitions  are  a  sufficient  basis  for  such  generalizations  [5], 

III.  Actual  and  Projected  Coverages 

In  this  paper  we  will  use  T(a4)  to  denote  the  standard  complete 
test  set  for  the  fault  event  a*  e  F* .  This  test  set  consists,  of  course, 
of  all  those  input  vectors  to  N  which  can  detect  a4,  and  can  be 
detailed,  for  example,  as  T(a*)  =  {X(  |  Z[X(;  YJ  ©  Z[X a«]  =  1}. 
Given  a  set  of  input  vectors,  denoted  as  T,  for  N,  we  will  be 
interested  in  the  capability  of  T  to  cover  (that  is,  detect)  fault 
events  in  F*.  Accordingly,  we  will  let  ACK(T )  £  F*  denote  the  set 
of  fault  events  in  F*  which  are  actually  covered  by  T.  In  other 
words,  a*  e  /4Cr(T)  implies  that  T  n  r(a*)  0.  We  will  in  the 

following  refer  to  the  actual  coverage  of  T  for  F*  as  being  com¬ 
plete  when  ACk(T)  —  F*.  It  should  be  kept  in  mind,  however,  that 
ACt(T)  is  usually  unknown,  and  it  is  the  goal  of  this  note  to 
consider  means  of  determining  or  approximating  ACk{T). 

Accordingly,  in  addition  to  actual  coverage,  we  will  be  very 
much  concerned  in  the  following  with  what  we  will  refer  to  as  the 
projected  coverage  of  a  fault  complex  by  a  lest  set.  This  projected 
coverage  will  be  an  estimate  of  the  actual  coverage.  Hence,  rela¬ 
tive  to  the  above,  we  will  let  PCk(T )  E  F*  denote  the  set  of  fault 
events  in  F*  which  are  estimated  to  be  covered  by  T  where 
PC k(T)  will  always  be  less  than  or  equal  to  4C*(T).  The  process 
of  calculating  FC*(T)  will  be  referred  to  as  test  set  coverage 
projection  and  can  generically  be  described  as  follows.  In  the 
network  N,  and  for  the  set  K  of  t>  lines  in  N,  let  1  and  J  form  a 
2-partition  of  K.  Let  T  now  be  a  set  of  input  vectors  whose  (either 
actual  or  projected)  coverages  for  the  component  fault  complexes 
F;  and  Fj  of  F*  are  known.  Test  set  coverage  projection  can  then 
be  described  as  the  process  of  projecting  the  coverage  of  T  to  Fa 
based  upon  our  knowledge  of  the  coverage  of  T  for  F(  and  F j. 
Accordingly,  we  will  refer  to  the  component  fault  complexes  Fj 
and  Fj  as  the  basis  of  projection  and  to  F*  as  the  target  fault 
complex.  In  Section  V,  we  will  give  a  projection  formula  which 
can  be  used  to  project  the  coverage  of  a  test  set  to  a  target  fault 
complex. 

It  should  be  noted  here  that  the  resulting  projected  coverage  by 
T  of  the  target  fault  complex  F*  would  ideally  be  equal  to  the 
unknown  actual  coverage  (that  is,  FCa(T)  =  ACK(T)).  If,  indeed, 
this  is  so,  then  in  the  following  we  will  say  that  the  projected 
coverage  of  T  for  Fa  is  exact.  However,  whether  or  not  the 
projected  coverage  is  exact  clearly  depends  on  a  number  of  fac¬ 
tors.  Certainly  one  of  the  leading  factors  is  whether  our  knowl¬ 
edge  of  the  coverage  of  T  for  the  basis  of  projection  is.  indeed, 
the  actual  coverage.  But  even  if  this  is  so,  other  factors  such  as 
the  mechanics  of  the  test  set  coverage  projection  and  the  relation¬ 
ships  which  exist  among  T  and  the  fault  events  of  F,  and  Fj 
obviously  also  enter  into  this  issue.  This  latter  factor  is  the  topic  of 
the  next  section. 

IV.  Consistency  and  Testing  Direction 

In  this  section  we  will  define  two  relationships  between  faults 
and  test  sets  which  will  be  seen  to  be  central  to  test  set  coverage 
projection  from  a  basis  to  a  target. 
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Definition  4.1 :  A  fault  event  ot,  e  F,  is  said  to  be  a  /(-consistent 
fault  event  for  an  input  vector  X,  relative  to  the  subfunction  H[X ; 
Y,]  if  H[X,:  Y,]  =  0  implies  H[X„  a,]  =  0  where  0  e  {0,  i}. 

Accordingly,  given  any  H[X;  Y|]  and  an  input  vector  X,  to  N, 
we  can  determine  the  set  of  fault  events 

Ff(X,)  =  {«,|HtXi;  Y,]  -  tf[X,;  a,]  =  0). 

In  other  words,  Ff(X/)  £  F,  is  the  set  of  all  the  ^-consistent  fault 
events  of  F,  for  X(  relative  to  W[X;  Yf].  Along  similar  lines,  we 
now  give  the  following. 

Definition  4.2:  Given  a  set  of  input  vectors  T  to  N  and  the  fault 
complex  F,,  a  fault  event  a,  e  F,  is  said  to  be  a  (0  -»  1  /-tested  fault 
event  for  T  if  for  some  input  vector  X,  e  T,  we  have  that  Z[X(; 
Y,]  =  0  and  Z(X,;  a,J  =  1. 

In  other  words,  if  a,  is  a  (0  ->  1  )-tested  fault  event  for  T,  then 
there  must  exist  at  least  one  input  vector  X,  e  T  which  makes  the 
fault-free  output  of  N  a  logical  zero  and  the  faulty  output  of  N 
when  the  fault  event  a,  is  present  a  logical  one.  In  the  following, 
such  input  vectors  will  be  said  to  (0  -*  latest  a,.  The  above  can 
similarly  be  stated  for  (1  -*  Obtesting. 

Using  Definition  42,  we  now  supplement  the  notation  in¬ 
troduced  in  Section  III  by  letting  AC?'(T)  and  AC'°(T),  respec¬ 
tively,  denote  the  set  of  fault  events  of  F,  which  are  actually 
(0  -♦  I  )-tested  and  (1  -» 0)-tested  fault  events  for  T.  Clearly, 
ACy(T)v  AC)°(T)=  AC,(T).  Similarly,  we  let  PC?{(T)  and 
PC'°(T).  respectively,  denote  the  set  of  fault  events  of  F,  which 
are  projected  to  be  (0  -» 1 /-tested  and  (1  -»0)-tested  fault  events 
for  T.  Clearly,  PC?'(T)  u  PCl°(T)  =  PC,{T). 

Finally,  given  a  fault  event,  say.  a,  e  AC,{T),  or  a >  e  PCj{T),  we 
will  use  (T(a,)  n  T)10  to  refer  to  the  set  of  all  those  input  vectors 
in  T,  each  of  which  (1  —  0)-tests  a,.  (T( aj)  n  T)01  can  be  in¬ 
terpreted  analogously. 

V.  Projection  Formula 

In  the  last  section,  we  defined  two  relationships,  namely,  con¬ 
sistency  and  testing  direction,  between  fault  events  of  component 
fault  complexes  and  test  sets.  Based  on  these  two  relationships,  we 
will  now  present  a  formula  for  obtaining  the  projected  coverage  of 
a  test  set  for  any  set  of  lines  in  N.  We  will  then  prove  a  fundamen¬ 
tal  theorem  which  relates  this  projected  coverage  to  the  actual 
coverage. 

Projection  Formula:  Given 

1)  a  set  K  of  lines  in  N  and  a  2-partition  /  and  J  of  K;  and 

2)  a  test  set  T  which  is  known  to  cover  the  set  PC,(T)  of  fault 
events  of  Fj  and  the  set  PCj(T)  of  fault  events  of  Fj\ 

then  the  projected  set  PCK(T)  of  the  fault  events  of  F*  covered 
by  T  is  given  as  follows. 

Part  1 .  If  Z[X ;  Yk]  =  (H[X ;  Y,]  +  AY[X ;  Y,])  +  0[X],  then 
PCa(T)  =  PCSt'(T)  u  PCl°(T) 

where 

PCV(T)=(PCV(T)  x  Fj)  v  (F,  x  PCJMT)) 
and 

PCi°(r)  =  Sj °(T)  u  SJ°(T) 

where 

Si°(T)  =  U  (  U  ({“■}  * 

\  X(*  (t  > 

and 

sj°(T)  =  u  (  U  miMM) 

•,fec,i»<r»  Vx,«<r  nri«,i)n 


Part  2:  If  Z|X;  Ya]  -  (H[X;  Y,]  Af[X;  YJ)  +  0{X].  then  the 
projection  formula  is  the  dual  of  Part  l.1 

Although  notationally  complex  in  structure,  the  Projection 
Formula  will  be  seen  to  be  quite  useful  in  the  next  section. 
Moreover,  a  rational  for  the  structure  of  the  Projection  Formula 
(and  its  associated  complexity)  can  be  provided  for  the  reader  by 
taking  the  general  perspective  of  path  sensitization  [7].  Note  that 
the  equation  for  PC$'(T)  suggests  that:  a)  for  every  at,  6  PCf'(T). 
we  will  have  that  every  a*  =  (a/t  etj),  where  a  j  e  FJt  will  also  be 
detected  by  T;  and  b)  for  every  otj  e  PCjl(T),  we  will  have  that 
every  a*  =  (at/, a,),  where  a,  e  F,  will  also  be  tested  by  T.  With  this 
in  mind,  if  we  refer  now  to  the  network  of  Fig.  2,  which  is  one 
possible  realization  of  the  given  Z[X ;  Y*J,  it  is  seen  that  because  of 
the  or  gate  at  the  output,  if  a  fault,  say,  a,  in  subnetwork  N,  was 
(0  l)-tested  by  means  of  sensitizing  line  1,  then  no  fault  in  Nt 
could  desensitize  this  line  and  mask  fault  at.  Similarly,- faults  in 
subnetwork  N,  cannot  mask  (0  -»  l)-tested  faults  of  N2-  The 
above,  then,  gives  a  rational  for  the  structure  of  the  equation  for 
PCi‘(T)  in  the  Projection  Formula. 

Similarly,  consider  the  equation  for,  say,  S}°(T)  relative  to  the 
network  of  Fig.  2.  Assume  that  a,  is  a  fault  in  N,  which  is 
(1  -» Obtested  by  some  X,  in  T.  In  other  words,  for  this  Xf.  line  1 
has  the  logical  value  of  1  when  aj  is  not  present  and  will  be  0 
otherwise,  and  line  2  will  be  0  in  both  instances.  Now,  if  a  fault, 
say,  ttj  occurs  in  N2  such  that  the  value  on  line  2  remains  at  0  for 
thisX,-,  then,  clearly,  Xf  will  also(l  -*  0)-test  the  fault  at  =  (a,,aj). 
From  Definition  4.1,  such  an  Fj  must  belong  to  FS(X,).  Hence,  all 
the  faults  in  {a,}  x  (F5(X/)}  will  be  detected  by  X,.  Moreover,  the 
above  would  hold  true  for  every  X,-e  T  which  (1  -*0)-tests  a,. 
Hence,  the  following  set  of  faults  will  be  (1  -» 0)-tested  by  T: 

(  U  (M  x  F$(X.))i 

\X,e(T  / 

Calculating  such  sets  for  all  a,  e  PC',  °(T)  and  unioning  them  pro¬ 
vides  the  rational  for  the  equation  for  S]°(T). 

It  is  important  to  note  that  the  Projection  Formula  has  an 
inherent,  recursive  nature  in  the  sense  that  the  projected  coverage 
PCk(T)  obtained  by  its  use  can  then  be  used  for  determining  the 
projected  coverage  of  T  to  other  fault  complexes  in  N  of  which  F* 
is  a  component  fault  complex.  It  should  also  be  pointed  out  that 
by  virtue  of  this  recursive  nature,  the  Projection  Formula  can  be 
stated  more  generally  for  r-partitions.  r  >  2  [5]. 

The  following  theorem  is  a  crucial  justification  of  the  Projection 
Formula. 

Theorem  5.1:  For  the  Projection  Formula, 

PCa(T)  c  aCa(T). 

Proof:  We  will  prove  this  theorem  only  for  the  case  in  which 
PCk(T )  is  obtained  by  using  Part  1  of  the  Projection  Formula, 
that  is.  for  the  case  where  Z[X ;  Y*]  =  (W[X ;  Y(]  +  AY [X ;  Yj])  + 
0{X].  Note  that  since  the  equations  in  Part  1  and  Part  2  of  the 
Projection  Formula  are  duals,  a  proof  of  the  theorem  correspond¬ 
ing  to  Part  2  of  the  Projection  Formula  follows  similarly.  The 
following  preliminaries  will  facilitate  the  proof. 

Let  «a  be  a  fault  event  of  F*.  We  know  that  a*  e  ACK(T )  if  and 
only  T( otK)  r,  T  f  <f>.  Hence,  to  show  that  for  every  a*  6  PC*(T). 
T(«a)  pi  T  4  4>,  we  will  study  the  composition  and  properties  of 
T(  ax). 

Now,  there  are  many  techniques  by  which  T(a»)  can  be 


1  {Note  that  tin «  0{x)  does  not  explicitly  pity  *ny  role  in  the  Projection  Formula, 
the  absence  of  exact  duality  in  Z[X ;  Yc]  in  Parts  l  and  2  is  not  of  importance  in  our 
considerations.) 
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detailed,  but  an  algebraic  or  functional  approach  which  will  be 
most  useful  in  our  considerations  is  the  Boolean  difference 
method  wherein 

7fc)-{Xi|ZlX.;  y*]©Z[X,;a,]=  1}. 

Since  in  Part  1  of  the  Projection  Formula  we  have  that  Z[X\ 
y*]  =  (H[X ;  y,]  +  M[X ;  y,])  +  0[X]  and  a*  =  (a,,  a j)  for  some 
«/  €  F|  and  aj  e  Fj,  we  can  also  express  T( a*)  as  follows: 

T(«»)  =  [X,\ ((H[Xi;  y,]  +  M[X,;  Yj ])  +  0[X,]) 

©  ((H(X,;  «,]  +  M[X, ;  SjJ)  +  0[X,])  =  lj.  (1) 

To  show  that  T(aic)  n  T  #  a*  6  PC^T),  we  must  determine 
T(a,)  and  T( aj),  since  out  information  about  T  is  limited  only  to 
its  coverage  of  the  fault  events  F ,  and  Fj.  As  above,  then,  we  can 
obtain  T(a()  and  T(a,)  as  follows: 

T(a,)  =  {X,|Z[X,;  y,]  ©  Z[Xt;  a,]  =  1} 

=  Mm**;  y/1  +  M[x,\  y,])  +  0[XJ) 

©  ((//[X.ot,]  +  M[X(;  Yj]  +  O(Xj)  =  1}  (2) 

and 

T(a,)  =  {X,|Z[X(;  Yj]  +  Z[X,;  a,]  =  1} 

=  {Xi|((//[X„-  y(]  +  M[X,;  x,])  +  ofxj) 

©  ((H[X,;  y,]  +  M[Xr.  a])  +  0{X,])  =  1}.  (3) 

With  these  details  regarding  the  composition  of  T(a*),  T(a,j, 
and  T(aj)  in  mind,  we  can  now  proceed  to  the  formalities  of  the 
proof.  To  show  that  PCK{T)  obtained  by  using  Part  1  of  the 
Projection  Formula  is  contained  in  or  is  equal  to  ACk{T),  we  will 
only  consider  the  PC'k°(T)  subset  of  PCK(T).  The  theorem  can 
similarly  be  proven  for  the  PC*‘(T)  subset  of  PCK{T). 

Let  a,  be  a  fault  event  of  F ,  in  PC)°(T).  Furthermore,  let  X,  be 
an  input  vector  in  T  which  (I  — •  0)-tests  a(,  that  is, 
X,  e  (T  n  T(a,))10.  By  Definition  4.2  for  this  X,  we  have  that 
Z[Xf;  Vj]  =  1  and  Z[X,-;  a,]  =  0.  But  more  particularly  observe 
that  from  (2)  this  implies  that  M[X,;  yj  =  0[X(]  =  0,  H[X(; 
FJ  =  1,  and  H[Xh  a,]  =  0.  Noting  that  we  have  M[Xf;  Yj]  =  0,  we 
now  consider  F5(X,),  the  fault  events  of  Fj  which  are  0-consistent 
relative  to  M[X ;  Jj]  for  X,.  Letting  aj  e  Fj(X(),  we  now  direct  our 
concern  to  the  fault  event  a*  =  (a,,  aj)  of  F*.  Since,  by  the 
Definition  4.1.  we  have  that  AffA'i;  Yj]  =  M[X,\  a,]  =  0,  it  should 
be  clear  that  together  with  the  above  observations  regarding  (2), 
we  have  from  (1)  that  XieT(<xK),  that  is,  T(3k)  r\  T  =£  tj>. 
Furthermore,  since  Z[X ;  V*]  =  1  and  Z[X ;  a*]  =  0,  we  have  also 
that  a*  e  ACl°(T). 

The  above  is  evidently  true  for  all  fault  events  of  F*  in  {a,}  x 
FS(Xj)  corresponding  to  every  X,f(T  n  T(a,))10.  In  other 
words, 

U  da'}  x  F/(X,))S  ACl°(T).  (4) 

X,t<T  "T<«,| |l® 

Moreover,  (4)  is  clearly  true  for  every  a,  6  PCj°(T).  Thus,  we 
have  that 

U  (  U  (!*/}  X  F?(X,»)  S  4Ci°(T).  (5) 

«,€PC,io(r)\X|«(T  ^T(>,l)l0  / 

For  convenience,  we  refer  to  the  left-side  set  of  (5)  as  Sj°(T). 

We  can  similarly  show  that 

u  (  u  (f?(x,)  x  {«,}))  e /icnn  (6) 

«, .  PC,iO(TI  lliilTi  T<«,|>10  / 
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As  before,  for  convenience,  we  refer  to  the  left-side  set  of  (6)  as 
S}°(T). 

Combining  (S)  and  (6)  yields 

PCi°(T)  =  S}°(T)  u  Sj°(T)  c  /lC]j0(r).  Q.E.D. 

In  other  words,  we  have  shown  in  Theorem  5.1  that  the 
projected  coverage  PC*(T)  obtained  by  using  the  Projection  For¬ 
mula  is  a  lower  bound  on  the  actual  coverage  ACt(T).  This  means 
that  the  Projection  Formula  gives  at  worst  an  inexact  coverage. 

Consider  now  the  particular  case,  wherein  T  is  a  complete 
single  fault  detection  test  set.  In  such  a  case,  PC%(T)  can  be  ob¬ 
tained  by  a  recursive  use  of  the  Projection  Formula.  That  is,  if 
either  PC,(T)  or  PCj(T)  or  both  are  not  available,  then  the 
Projection  Formula  could  first  be  used  to  calculate  those  unavail¬ 
able  projection  sets.  In  fact,  since  we  arc  dealing  with  a  complete 
single  fault  detection  test  set,  such  a  recursive  use  of  the  formula 
would  continue  until  the  component  complexes  being  considered 
are  associated  with  single  lines  of  N.  It  might  now  seem  that  for 
large  K,  such  recursive  use  of  the  Projection  Formula  could  lead 
to  a  poor  PCk(T)  as  an  estimate  to  AC*(T),  but  such  is  not  always 
the  case,  as  is  shown  in  the  next  section. 

VI.  Actual  and  Projected  Coverages  Revisited 

In  this  section  we  will  consider  some  cases  to  illustrate  situa¬ 
tions  in  which  the  projected  coverage  can  be  shown  to  be  com¬ 
plete  or  exact  relative  to  the  actual  coverage  of  the  target  fault 
complex. 

We  begin  by  giving  the  following  theorem  regarding  com¬ 
pleteness  in  the  sense  that  situations  can  be  specified  wherein  the 
projected  coverage  given  by  the  Projection  Formula  will  be  the 
entire  target  fault  complex,  that  is,  PCK(T)  =  F*.  Doing  so  will 
not  only  give  further  insight  into  the  Projection  Formula,  but  will 
also  reinforce/extend  the  principal  results  available  in  the  litera¬ 
ture  [l]-[4]  regarding  complete  multiple  fault  coverage  situations 
for  single  fault  detection  test  sets. 

Theorem  6.1: 

a)  Given  any  set  K  of  lines  and  a  complete  single  fault  detec¬ 
tion  test  set  7j  for  a  cascaded  two-level  network,  the  projected 
coverage  FC*(7j)  obtained  from  the  Projection  Formula  equals 
F*.  (See  [1].) 

b)  Given  any  set  K  of  lines  and  a  complete  single  fault  detec¬ 
tion  test  set  T,  for  an  internal  fanout-free  network  N,  if  the  lines  of 
K  are  inputs  to  three  or  fewer  gates,  the  projected  coverage 
PCn(T,)  obtained  from  the  Projection  Formula  equals  F*.  (See  [2] 
and  [3].) 

c)  Given  any  set  K  of  lines  and  a  complete  single  fault  detection 
test  set  T,  for  an  internal  fanout-free  network  N,  if  Z[X; 
Fa]  =  H[X ;  y,]*M[X;  Y,]  +  0[X],  |/|  =  |K|-1,  |J|  =  1, 
and  if  T,  covers  all  the  fault  events  of  F,,  then  the  projected 
coverage  PCn(T,)  obtained  from  the  Projection  Formula  equals 
F*.  (See  [4].) 

Proof:  Given  in  [5],  and  available  from  the  authors. 

We  will  now  consider  some  of  those  situations  wherein  the 
Projection  Formula  is  to  be  used,  but  the  conditions  of  Theorem 
6.1  are  not  satisfied.  Our  concern  now  will  be  exact  projected 
coverage.  We  start  with  the  following. 

Definition  6.1 :  Given  a  set  K  of  lines,  a  2-partition  /  and  J  of  K, 
and  a  set  T  of  input  vectors  for  N.  If  every  X,  e  T  is  such  that  it 
covers  at  least  one  fault  event  of  F,  u  Fj,  or  if  every  X,  e  T  which 
does  not  cover  any  fault  event  of  F(uf;  is  such  that  it  docs  not 
cover  any  fault  event  of  F*  as  well,  then  T  is  said  to  satisfy  the 
exactness  requirement  for  F»  and  Fj  relative  to  F,. 

We  will  now  prove  a  theorem  which  establishes  a  relation  bc- 
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tween  PCAT)  and  AC*(T)  when  T  satisfies  the  exactness 
requirement. 

Theorem  62:  For  the  framework  of  the  Projection  Formula,  if 
PC  AT)  =  AC  AT).  PC  AT)  =  AC  AT),  and  T  satisfies  the 
exactness  requirement  for  F/  and  F,  relative  to  F*.  then  PCAT) 
obtained  from  the  Projection  Formula  equals  AC  AT). 

Proof:  We  will  prove  this  theorem  (by  contradiction)  only  for 
the  case  in  which  PCAT)  is  obtained  by  using  Part  1  of  the 
Projection  Formula  as  the  other  case  follows  similarly.  We  will 
also  take  advantage  of  the  preliminaries  established  in  the  proof  of 
Theorem  5.1. 

Thus,  assume  that  in  F*  there  exists  a  fault  event 

a*  =  (a/,  a,)  €  AC  AT)  =  ACaA(T)  u  ACl°(T) 


where 

*«  6  ACV(T). 

but  a*  4  PCaA(T) 

(7) 

or  where 

aKeACi°(T). 

but  aK  4  PCi°(T). 

(8) 

We  will  now  consider  the  implications  of  (8)  and  show  that  it 
leads  to  a  contradiction.  It  can  similarly  be  shown  that  (7)  would 
also  lead  to  a  contradiction. 

Suppose  now  that  (8)  is  true.  Then  since  a*  6  /4Ci°(T),  consider 
any  X,-  e  (T  n  T( a*))10.  For  this  X,  we  have  that  Z(X(;  V*]  =  1 
and  Z[X(;  a*]  =  0.  But  more  particularly  observe  from  (I)  that 
this  implies  that 

H[Xi;  Y,]  +  M[X(;  Yj ]  =  1 

and 

//[X„  a,]  =  MIX,.  =  0{XJ  =  0. 

Along  these  lines,  suppose  that  W[X,;  T,]  =  1  and  M[X,; 
K,]  =  0.  Then,  clearly,  Z[X.;  T,]  =  1  and  Z[X,;  a,]  =  0,  so 
a,  e  ACj°(T).  Furthermore,  since  we  have  by  hypothesis  that 
PC  AT)  =  AC  AT),  then  a ,  e  PC}°(T).  However,  since  M[X,; 
Yj]=  M[X a j]  =  0,  then  a,  e  F°(X|).  But  this  implies  that 
a*  e  ({at/}  x  Fj(X,))  S  PCl°(T),  which  contradicts  (8).  Similarly, 
if  we  assume  that  H[X Vi]  =  0  and  M[X,;  Yj)=  1,  we  are  also  led 
to  a  contradiction  of  (8).  Finally,  the  only  remaining  alternative  is 
W[Xj;  Vj]  =  M[Xj;  Vj]  =  1.  But  from  (2)  and  (3)  this  implies  that 
X,  i  T( a,)  and  X,  (  T(af)  for  any  a,  €  F,  or  <x,  e  Fj,  respectively. 
This,  of  course,  contradicts  the  exactness  requirement  on  T  which 
states  that  every  X,  e  T  covers  at  least  one  fault  event  in  Ft  u  Fj, 
or  if  Xj  does  not  cover  any  fault  event  of  F,  u  F j.  then  X,  does 
not  cover  any  fault  of  F*.  We  therefore  have  that 

ACi°(T)  =  PCi°(T).  Q.E.D. 

It  is  clear  from  the  proof  of  Theorem  6.2  that  if  PCAT )  = 
AC  AT)  and  PC  AT)  =  AC  AT),  then  the  projected  coverage 
PCAT)  differs  from  the  actual  coverage  ACK(T)  only  by  those 
fault  events  of  F*  which  are  covered  by  the  input  vectors  in  7,  but 
which  do  not  cover  any  fault  event  of  F;  or  Fj. 

The  result  stated  in  Theorem  6.2  is  useful  for  determining  the 
actual  coverage  of  a  complete  single  fault  detection  test  set  for 
multiple  faults  by  means  of  the  Projection  Formula.  More  particu¬ 
larly,  given  a  set  X  of  lines  and  a  complete  single  fault  detection 
test  set  T,  for  N.  suppose  that  we  use  the  Projection  Formula  to 
determine  the  projected  coverage  PCAT,).  Suppose  further  that 
for  every  execution  of  the  Projection  Formula  algorithm,  we  have 


that  the  actual  coverage  of  T,  for  the  projection  basis  is  known, 
and  the  exactness  requirement  on  T,  for  the  projection  basis  rela¬ 
tive  to  the  target  complex  is  satisfied.  Then,  clearly,  by  Theorem 
6.2  it  can  be  concluded  that  PC  AT,)  so  obtained  would  be  the 
actual  coverage  /4C*(T,). 

Example  2:  Consider  the  network  of  Fig.  1(a)  and  the  set  X  of 
16  lines  mentioned  in  Example  1.  By  using  the  Projection  For¬ 
mula  with  T  as  the  set  of  input  vectors  given  in  Fig.  1(b),  it  can  be 
shown  that  |PC*(7j)j  =  29822.  Furthermore,  it  can  be  seen  that 
the  conditions  of  Theorem  6.2  are,  indeed,  satisfied  for  every 
execution  of  the  Projection  Formula,  and  therefore  we  have  that 
PCAT)  =  AC  AT). 

When  we  consider  those  situations  in  which  neither  the  condi¬ 
tions  of  Theorem  6.1  nor  the  exactness  requirement  is  satisfied,  we 
simply  know  that  PCAT)  is  a  lower  bound  to  AC  AT)  First  of  all 
it  might  be  that  this  lower  bound  is  quite  close  to  complete  cover¬ 
age  and,  therefore,  is  acceptable  in  some  applications.  But  more 
importantly,  when  we  have  the  situation  where  the  exactness  re¬ 
quirement  is  not  satisfied  and  when  the  projected  coverage  is  too 
low  to  be  acceptable  with  its  uncertainty,  then  there  is  no  recourse 
but  to  utilize  some  classical  means  of  determining  the  actual 
coverage  of  T  for  F». 

VII.  Conclusion 

In  conclusion,  our  experience  in  using  the  Projection  Formula 
on  various  internal  fanout-free  networks  for  various  fault  com¬ 
plexes  [5],  [6]  has  shown  that  the  common  assumption  that  the 
coverage  capability  of  test  sets  is  usually  satisfactory  for  multiple 
faults  on  sizes  greater  than  that  for  which  test  sets  were  explicitly 
generated  must  be  used  with  discretion,  as  it  has  been  so  seen  that 
even  for  such  networks,  the  validity  of  this  assumption  is  highly 
network  structure/lest  set  dependent.  However,  it  seems  to  be 
possible  to  characterize  the  most  difficult  fault  complexes  for 
single  fault  detection  test  sets  to  cover  by  generic  models  [6].  This 
suggests  that  if  such  generic  models  could  be  avoided  in  the  design 
structure  of  networks,  the  overall  coverage  capability  would  be 
enhanced.  Indeed,  in  an  extreme  sense,  this  is  the  approach  used  in 
the  results/networks  reported  in  literature  for  which  any  complete 
single  fault  detection  test  sets  cover  all  multiple  faults  [1], 
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Multiple  Fault  Detection  in 
Programmable  Logic  Arrays 

VINOD  K.  AGARWAL 

Abstract — The  increasing  recognition  or  PLA's  as  efficient  anti  viable 
modules  for  such  purposes  as  microprogramming  and  design  of  sequential 
controllers  has  led  to  a  growing  interest  in  the  development  of  optimum  fault 
detection  test  sets  for  these  modules.  It  is  now  well  known  that  a  fault  type  which 
is  unique  to  PLA's  is  the  class  of  contact  faults.  A  single  contact  fault  is  the 
spurious  presence  or  ahsenre  of  a  contact  between  a  row  and  a  column  of  a  PLA. 
We  consider  in  this  paper  the  problem  of  determining  the  capability  of  complete 
single  contact  fault  test  sets  to  coter  multiple  contact  faults  of  PLA’s.  Our 
approach  consists  of  developing  a  model  of  PLA’s  which  allows  one  to  represent 
a  contact  fault  in  a  PLA  as  a  stuck-at  fault  in  the  model  of  the  PLA.  Using  this 
model,  it  is  shown  that  more  than  98  percent  of  all  multiple  contact  faults  of 
size  8  and  less  are  inherently  covered  by  every  complete  single  contact  fault  lest 
set  in  a  PLA.  Applications  of  this  model  to  sluck-at  fault  diagnosis  are  also 
discussed. 

Index  Terms — Contact  faults,  masking,  multiple  fault  detection,  PLA  fault 
detection,  PLA  modeling,  programmable  logic  arrays,  single  fault  coverage. 

1.  Introduction 

A  programmable  logic  array  (PLA)  is  normally  used  to  implement 
mullioutput  combinational  logic  by  programming  “blank"  arrays  of 
an  AND-OR  (or  NOR-NOR)  matrix  1 1  ].  The  economy  and  flexibility 
accrued  by  using  PLA's  have  moreover  led  to  their  growing  usage  in 
such  areas  as  microprogramming,  sequential  controllers,  function 
generators,  and  code  conversion.  The  availability  of  Held  program¬ 
mable  logic  arrays  has  furthermore  provided  the  logic  designer  ad¬ 
ditional  freedom  to  program  an  array  on  the  site  by  blowing  fusible 
links  within  the  array.  The  reliability  of  these  devices  is  therefore  a 
matter  of  considerable  importance  [2J-|5J. 

Accordingly,  this  paper  is  concerned  with  the  problem  of  fault 
detection  (that  is,  fault  testing)  in  PLA’s.  Since,  conceptually,  a  PLA 
is  simply  a  collection  of  many  two-level  and-OR  networks,  it  may 
be  argued  that  a  PLA  can  be  tested  by  using  the  well-known  tech¬ 
niques  [6)  for  testing  AND-OR  networks.  However  the  memory-like 
structure  of  a  PLA  not  only  leads  to  the  usual  stuck-at,  bridge-type. 
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and  shorled-diode  faults  found  in  most  combinational  networks,  but 
it  also  results  in  a  unique  type  of  contact  faults  not  found  in  such 
networks.  A  contact  fault  (called  a  crosspoint  fault  in  )2)  and  |3)  and 
shrinkage  and  appearance  faults  in  (4j)  in  a  PLA  is  caused  by  the 
spurious  presence  or  absence  of  a  contact  (that  is,  a  fusible  link)  be¬ 
tween  a  row  and  a  column  of  the  PLA.  In  a  recent  paper  [2]  by  Os- 
tapko  and  Hong,  a  scheme  to  generate  a  test  set  which  detects  the 
presence  of  every  single  contact  fault  in  a  PLA  is  presented.  More 
importantly,  it  is  also  shown  in  their  paper  that  any  such  test  set  for 
a  PLA  inherently  covers  (that  is,  detects)  most  of  its  single  stuck-at, 
bridge-type,  and  shorted-diode  faults  as  well.  Similar  results  are 
likewise  obtained  in  [3],  |4]. 

Nonetheless,  since  the  single  fault  model  does  not  account  for  all 
the  probable  failures  (7)  of  an  LSI  chip,  it  is  clear  that  for  high  reli¬ 
ability  applications,  PLA’s  must  be  tested  under  a  more  general 
multiple  fault  model.  But  it  is  equally  obvious  that  the  consideration 
of  various  multiple  faults  which  could  possibly  occur  in  a  PLA  would 
be  extremely  impractical.  For  example,  in  a  PLA  containing  16  input 
variables,  48  product  terms,  and  8  output  lines,  there  are  only  48  (2 
X  16  +  8)  =  1920  single  contact  faults,  but  extremely  large  2 1920 
multiple  contact  faults!  Thus,  as  in  the  case  of  combinational  net¬ 
works,  a  practical  compromise  in  such  a  situation  would  be  to  assume 
that  a  complete  single  contact  fault  test  set  for  a  PLA  covers  most  of 
its  multiple  contact  faults  as  well.  The  aim  of  our  paper  is  to  establish 
the  validity  of  this  assumption. 

II.  Contact  Faults  as  Stuck-At  Faults 

The  logic  implemented  by  a  PLA  can  be  very  conveniently  de¬ 
scribed  by  a  two-dimensional  array  of  0’s  and  1  ’s,  which  is  often  re¬ 
ferred  to  as  the  personality  of  the  PLA  [2],  The  personality  of  a  PLA 
which  we  will  consider  in  the  following  is  simply  a  representation  of 
whether  the  contact  between  a  row  and  a  column  of  the  PLA  is  a  0- 
contactora  I -contact,  where  0-contacts  and  I -contacts  are  defined 
by  the  following. 

Definition  I:  Given  a  row  and  an  output  column  in  a  fault-free 
PLA,  a  I  -contact  (O-contact)  will  be  said  to  exist  between  the  row  and 
the  column  if  the  link  between  the  two  is  intact  (correspondingly,  is 
not  intact,  that  is,  electrically  fused). 

Definition  2:  Given  a  row  and  an  input  column  in  a  fault-free  PLA, 
a  0-contact  ( 1  -contact)  will  be  said  to  exist  between  the  row  and  the 
column  if  the  link  between  the  two  is  intact  (correspondingly,  is  not 
intact). 

A  two-input  PLA  with  two  rows  and  two  output  functions  is  shown 
in  Fig.  1(a).  Using  the  above  definitions,  the  personality  shown  in  Fig. 
1(b)  is  obtained.  Finally,  Fig.  1(c)  illustrates  the  equivalent  logic  path 
of  this  PLA. 

We  now  formally  define  a  single  contact  fault. 

Definition  3:  A  single  0-contact  (I -contact)  fault  is  said  to  exist 
in  a  PLA  if  due  to  some  failure,  a  0-contact  (1 -contact)  of  the  fault- 
free  PLA  becomes  a  1 -contact  (0-contact)  in  the  faulty  PLA. 

Accordingly,  the  total  number  of  single  contact  faults  which  could 
possibly  occur  in  a  PLA  would  simply  be  the  number  of  entries  in  its 
personality.  For  a  single-input  decoder  PLA  with  n  inputs,  m  rows, 
and  p  output  functions,  this  number  is  easily  seen  to  be  m(2n  +  p). 
Moreover,  by  generalizing  Definition  3  to  multiple  contact  faults,  the 
total  number  of  multiple  contact  faults  of  a  given  size,  say  r  >1,  is 
seen  to  be  (m<2;+/,t)i  an(j  lota|  number  of  all  single  and  multiple 
contact  faults  is 

m(2n  +  p)j  _  2m(2„+p)  _  , 

Assume  that  Tc  uenotes  a  complete  single  contact  fault  detection 
test  set  for  a  PLA;  that  is,  Tr  is  such  that  for  each  single  contact  fault 
in  the  PLA,  there  exists  at  least  one  input  vector  in  Tr  for  which  the 
fault-free  output  of  the  PLA,  on  at  least  one  output  line,  is  different 
from  the  output  in  the  presence  of  the  fault.  Similarly,  let  Tm  denote 
a  complete  single  and  multiple  contact  fault  detection  test  set  for  a 
PLA.  As  pointed  out  in  the  previous  section,  the  generation  of  an 
optimal  Tm  is  a  rather  impractical  task.  Therefore,  our  concern  in 
this  paper  is  to  determine  the  capability  of  a  Tt  for  a  PLA  to  cover 
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its  multiple  contact  faults  as  well.  We  do  this  by  developing  a  com¬ 
binational  network  model  of  the  PLA  such  that  each  single  contact 
fault  in  it  is  functionally  equivalent  to  some  single  stuck-type  fault 
in  the  modeled  network.  This  model  will  be  referred  to  as  the  SAE 
network  (stuck-at  equivalent  network)  of  the  PLA. 

The  SAE  network  for  a  PLA  is  best  obtained  by  considering  con¬ 
tact  faults  on  input  and  output  columns  in  separate  manners.  Ac¬ 
cordingly,  consider  first  the  simple  and  network  shown  in  Fig.  2(a), 
and  assume  that  input  lines  x  and  y  correspond  to  some  input  columns 
of  a  PLA  and  the  gate  output  line  W  corresponds  to  some  row  of  the 
PLA.  Since  each  contact  on  lines  x  and  y  can  be  in  two  different  po¬ 
sitions,  there  are  four  different  “programming"  configurations  as¬ 
sociated  with  this  and  gate,  as  shown  in  Fig.  2(a).  To  be  able  to 
represent  a  contact  fault  as  a  stuck-at  fault,  we  will  assume  that  each 
contact  of  a  PLA  is  represented  by  an  input  Boolean  variable  in  its 
SAE  network,  and  that  its  value,  0  or  1  (as  determined  by  Definition 
2),  in  the  PLA  is  the  value  of  the  line  which  carries  that  Boolean 
variable  in  the  SAE  network.  More  particularly,  suppose  a,  is  a 
Boolean  variable  whose  value  is  0(1)  if  line  x  has  a  0-contact  (I- 
contact).  Let  by  be  similarly  defined  with  respect  to  liney.  With  the 
introduction  of  these  two  new  variables,  W  then  becomes  a  function 
of  x,  y,  ax,  and  by,  as  shown  in  the  A'-map  of  Fig.  2(b).  It  is  important 
to  note  here  that  ax  and  b}  do  not  depend  on  the  values  of  x  and  y, 
but  only  on  the  contacts  on  those  lines.  Therefore,  x,  y,  ax,  and  6,  are 
four  independent  variables.  Using  now  the  straightforward  Karnaugh 
map  technique,  we  get 

W=  (ax  + x)(by  + y).  (1) 

A  network  realizing  (1)  is  shown  in  Fig.  2(c). 

Comparing  the  network  of  Fig.  2(c)  with  those  of  Fig.  2(a).  we  note 
that  depending  on  the  values  of  ax  and  by,  the  former  network  can 
be  made  equivalent  to  any  one  of  the  four  "programming"  configu¬ 
rations  of  the  later  networks.  For  example,  for  the  top-left  configu¬ 
ration  of  Fig.  2(a),  the  equivalent  network  is  obtained  by  assigning 
ax  =  I  and  6,.  =  I.  More  importantly,  this  comparison  also  shows  that 
a  0-contact  (I -contact)  fault  on,  say  line  x  of  a  “programming” 
configuration,  can  be  represented  in  its  equivalent  network  by  simply 
assigning  a  stuck-at- 1  (stuck-at-0)  fault  to  line  ox.  In  other  words, 
we  have  shown  that  the  SAE  network  for  a  PLA’s  input  columns  can 
be  easily  obtained  by  replacing  each  and  gate  of  the  PLA  with  a 
corresponding  network  of  the  type  shown  in  Fig.  2(c).  This  replace¬ 
ment  in  the  example  PLA  of  Fig.  I  results  in  the  left  part  of  Fig 
4. 

Similar  to  the  above,  the  stuck-at  fault  representation  of  contact 
faults  on  the  output  columns  of  a  PLA  is  obtained  by  considering  the 
OR  gate  network  shown  in  Fig.  3(a).  Assume  here  that  input  lines  M', 
and  W'j  correspond  to  some  rows  of  a  PLA  and  the  gate  output  / 
corresponds  to  some  output  line  of  the  PLA.  Moreover,  as  above,  let 
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Fig.  2.  SAH  network  for  and  gate. 


(a) 


Fig.  3.  SAE  network  for  or  gate. 


C|  and  Cj  denote  Boolean  variables  whose  value  is  0(1 )  iflines  W\  and 
W 2 ,  respectively,  have  a  0-contact  (1-contact).  Then  the  A'-map  [see 
Fig.  3(b)J  for /in  terms  of  IF,,  W2,  c i,  and  c2  leads  to  the  function 

/=c,fF, +c2»F2  (2) 

whose  realization  is  shown  in  Fig.  3(c). 

Here  again,  one  easily  sees  that  depending  on  the  values  of  C|  and 
c 2,  each  of  the  four  “programming"  configurations  of  Fig.  3(a)  has 
an  equivalent  network  in  Fig.  3(c).  Also,  as  above,  a  0-contact  (I- 
contact)  fault  on,  say,  line  IF,  of  a  “programming”  configuration  is 
represented  in  its  equivalent  network  as  a  stuck-at-1  (stuck-at-0)  fault 
on  the  c i  line.  Based  on  these  developments,  the  SAE  network  for  the 
example  PLA  of  Fig.  1  finally  results  in  the  network  of  Fig.  4. 

Some  important  points  should  be  mentioned  here  before  we  use  this 
SAE  network  model  for  multiple  fault  coverage  analysis  in  the  next 
section.  First  of  all,  note  from  Fig.  4  that  although  the  SAE  network 
contains  fanout  both  at  primary  input  lines  and  internal  lines,  the 
internal  line  fanout  docs  not  reconverge.  Therefore,  when  seen  with 
respect  to  any  one  output  line,  the  corresponding  subnetwork  of  an 
SAE  network  is  clearly  an  internal  fanout-free  network  [12},  [13} 
consisting  only  of  three  levels  (because  now  levels  2  and  3  can  be 
merged  into  one  level). 

Second,  note  that  an  SAE  network  contains  two  types  of  primary 
inputs:  the  n regular  inputs x,.  x2.  .  Jr„.  and  m[2n  +/>)  contacts 

inputs  which  correspond  to  m(2n  +  p)  contacts  of  the  associated 
PLA.  In  the  fault-free  SAE  network,  each  of  these  m(2n  +  p)  contact 
inputs  is  assigned  a  permanent  0  or  1  depending  on  the  value  of  the 


corresponding  contact  in  the  PLA.  Thus,  it  should  be  clear  that  each 
contact  input  line  with  the  value  0(  I )  could  contain  only  one  fault  of 
interest,  namely,  sal  (saO),  and  that  the  other  fault,  saO  (sal),  on  that 
line  would  be  meaningless.  The  resulting  m(2n  +  p)  single  stuck-at 
faults  of  interest  on  these  m(2n  +  p)  contact  input  lines  would,  of 
course,  correspond  to  the  m(2n  +  p)  single  contact  faults  of  the  PLA. 
Moreover,  a  multiple  contact  fault  on  a  PLA  would  thus  be  repre¬ 
sented  by  a  multiple  stuck-at  fault  on  the  contact  input  lines.  For  the 
convenience  of  description  in  the  remainder  of  this  paper,  we  will, 
whenever  there  is  no  change  of  ambiguity,  use  the  terms’ “contact 
fault(s)"  and  "stuck-at  fault(s)  on  contact  input  lines”  interchan- 
gably. 

Finally,  it  is  important  to  note  that  although  an  SAE  network  of 
a  PLA  is  far  more  complex  than  its  equivalent  logic  network  [compare 
Figs.  4  and  1  (c)).  a  significant  difference  between  the  two  represen¬ 
tations  is  that  whereas  the  former  network  represents  the  complete 
PLA  structure,  the  latter  network  represents  only  the  logic  realized 
by  the  PLA. 

111.  Multiple  Contact  Fault  Coverage 

As  pointed  out  in  the  last  section,  each  SAE  network  output  line 
is  the  output  of  an  internal  fanout-free  combinational  network.  This 
observation  immediately  suggests  the  possibility  of  using  various 
previously  known  results  [8]- [  1 1  ]  about  multiple  stuck-at  fault 
coverage  in  internal  fanout-free  networks  to  determine  the  multiple 
contact  fault  coverage  in  PLA’s.  For  example,  it  is  well  known  that 
in  an  irredundant,  internal  fanoul-frcc  network,  every  multiple 
stuck-at  fault  of  size  2  and  3  is  covered  by  any  test  set  that  covers  ail 
the  single  stuck-at  faults  of  the  network.  Therefore,  we  immediately 
get  the  following  from  this  well-known  result. 

Lemma  I:  Each  Tr  of  an  irredundant  PLA  covers  every  multiple 
contact  fault  of  size  2  and  3. 

For  multiple  stuck-at  faults  of  sizes  greater  than  3,  it  is  shown  in 
[II]  that  the  greatest  lower  bound  on  the  capability  of  single  stuck-at 
test  sets  to  cover  multiple  faults  can  be  obtained  by  a  simple  table 
look-up  process.  Thus,  similar  to  Lemma  1,  this  result  could  be  used 
to  determine  the  greatest  lower  bound  on  the  multiple  contact  fault 
coverage  capability  of  a  Tc  in  a  PLA.  However,  we  now  show  in  the 
following  that  it  is  possible  to  obtain  better  multiple  contact  fault 
coverage  bounds  by  using  the  concept  of  masking  than  by  applying 
previously  known  results. 

Definition  4:  Given  a  row,  say  Wt,  and  an  input  variable,  say  Xj. 
let  the  contact  between  row  W,  and  column  jf,  be  denoted  by  a,j  and 
let  the  contact  between  row  IF,  and  column  Xj  be  denoted  by  b„  where 
a,,  and  by  e  |0, 1  j.  Then  Ay  =  (a,y  +  Xj)(b,j  +  x,)  will  be  referred  to 
as  an  interna!  variable  for  IF,  in  terms  of  x2. 

An  internal  variable  Ay  is  simply  a  function  of  two  contact  input 
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variables  a,,  and  b,,  and  one  regular  input  variable  x,.  Using  this 
definition,  the  product  function  realized  by  each  row  can  be  conve¬ 
niently  written  as 

W,  =  A,,Al2  A,„  (3) 

where  K  i  $  m. 

Similar  to  Definition  4.  let  the  contact  between  an  output  column 
ft.  £  k  £  p  and  a  row  W,  be  denoted  by  c*,  where  r*,  e  (0, 1).  Then 
the  output  ft,  can  be  written  as 

/*=a.^l+c*jH'J+  +ckmWm 

=  C*|(Al|A|2  A|„)  +  C42(A2|A22  A2„) 

■f  4-  c^m(Am\Am2  Amn).  (4) 

Definition  5:  A  single  stuck-at  fault,  say  «i,  is  said  to  be  masked 
by  another  single  stuck-at  fault,  say  a 2 ,  for  an  input  vector  X,  if  X, 
tests  at,  but  does  not  test  the  simultaneous  multiple  fault  ata2. 

With  the  help  of  above  definitions.  Lemmas  2  and  3  specify  all  the 
necessary  conditions  under  which  masking  lakes  place  in  a  PLA. 

Lemma  2:  A  0-contact  fault  in  a  row  of  a  PLA  can  be  masked  only 
by  a  I -contact  fault  in  the  same  row. 

Proof:  Let  a,  be  a  0-contact  fault,  say  on  row  1  and  column  x> 
That  is,  A\  \  =  (an  +xt)(bn  A-Xi)  where 0|,  =  Oand  the  fault  at 
causes  A n  to  become  (1  +  Ji)(b|i  +  *1)  Since  the  a,,  =  bjj  =  0 
combination  is  never  allowed,  we  must  have  that  bn  —  1.  Assume 
further  now  that  this  fault  «i  is  tested  by  an  input  vector  A",  e  Tc 
through  the  output  line,  say  f\.  Then  since 

ft  =  Cl!  fPl  +  (12^2  +  +  CimW'm 

and  W |  =  Ai\At2.  .  A\n. 

X,  can  test  at  iff  for  input  X,.  we  have 

=  =c]mWm  =  0 

c,,  =  1 

A\2  -  A\i  -  -  ■  -  -  Ai„  -  I 

and  An  =  (0  + X|)(l  +  X|)  =  0  for  the  fault-free  case  and  An  =  (1 
+  3f|)(l  +X|)  =  1  in  the  presence  of  ai. 

Let  Oj  be  any  other  single  contact  fault  in  PLA.  We  now  show  that 
unless  a2  is  a  1 -contact  fault  on  row  1,  it  cannot  mask  aj.  To  begin, 
suppose  that  a2  is  any  contact  fault  on  row  i,  i  1 .  Then  since  cufV, 
~  0  in  the  fault-free  case,  the  presence  of  a2  would  at  the  worst  make 
C|/  W,  =  1 .  But  this  would  imply  that  ft  =  0  in  the  fault-free  case  and 
ft  =  1  in  the  presence  of  the  fault  «ia2,  i.e.,  X,  would  also  test  the 
double  fault  aio2.  Thus,  a2  would  not  mask  a  1  if  a2  is  on  any  row  /, 

1  ^  1 . 

Suppose  next  that  a2  is  a  0-contact  fault  on  row  1.  If  thisa2  were 
on  row  1  and  any  one  of  the  output  columns  2, 3 .•••,/>,  then  it  is  clear 
that  the  presence  of  a2  woule  never  mask  the  testing  of  ai  through 
ft.  Similarly,  if  a2  were  on  row  1  and  any  input  column,  say  j,  then 
we  would  have  that  the  faulty  At,  =  1  and,  hence,  that  a2  does  not 
mask  «|. 

Thus,  we  have  shown  that  unless  a2  is  a  1 -contact  fault  on  row  1, 
it  cannot  mask  at.  Even  though  this  proof  is  given  only  for  the  case 
when  «i  is  on  an  input  column,  a  similar  proof  for  at  on  an  output 
column  can  be  easily  provided.  Q.E.D. 

Lemma  J:  A  1  -contact  fault  in  a  row  of  a  PLA  can  be  masked  only 
by  a  0-contact  fault  in  some  other  row. 

Proof:  The  lemma  can  be  proven  in  a  similar  manner  to  Lemma 

2. 

These  two  lemmas  are  important  because  they  specify  all  the 
conditions  under  which  one  contact  fault  may  be  masked  by  another 
contact  fault  in  a  PLA.  Using  these  lemmas,  the  proof  of  Lemma  1 
is  straightforward'!  1 5). 

Although  Lemma  I  ensures  that  each  Tr  covers  all  faults  of  sizes 

2  and  3,  a  multiple  contact  fault  of  size  4  or  larger  might  not  be  cov¬ 
ered  by  each  Tc  In  fact.  Fig.  5  contains  an  example  of  a  simple  PLA 
where  four  contact  faults  00,  or2,  at 3,  <*4  are  such  that  01  masks  04, 
«4  masks  a 3,  «i  masks  a2.  and  «2  masks  oi  .  Therefore,  the  multiple 
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4-way  masking  cycle 

Fig.  5.  PLA  with  four-way  masking  cycle. 

fault  aia2ajO(4  is  not  detected  by  the  four  input  vectors,  shown  in  Fig. 
5,  which  detect  the  single  contact  faults  ai,  a2,  a2,  and  04.  A  similar 
observation  about  multiple  stuck-at  faults  of  size  4  is  well  known  in 
the  literature  (9J,  (1 1], 

This  phenomenon  of  four-way  masking  cycle,  which  requires  two 
pairs  of  a  0-contact  fault  and  a  1-contact  fault  on  two  separate  rows, 
is  very  crucial  to  the  undelectabiliiy  of  a  multiple  contact  fault  by  a 
77  in  an  irredundant  PLA. 

Theorem  I:  Every  multiple  contact  fault  in  an  irredundant  PLA 
is  detected  by  any  Tc  provided  the  multiple  fault  does  not  contain  pairs 
of  a  0-conlact  fault  and  a  1 -contact  fault  on  more  than  one  row. 

Proof:  Clearly,  if  a  multiple  fault  indeed  contains  pairs  of  a  0- 
contact  fault  and  a  1  -contact  fault  on  only  one  row,  say  /,  then  every 
other  row  in  the  PLA  which  contains  component  single  contact  faults 
of  the  multiple  fault  must  contain  either  all  the  0-conlact  faults  or 
all  the  I -contact  faults. 

Now  since  row  1  contains  a  0-contact  and  a  1 -contact  fault  pair, 
it  might  be  possible,  by  Lemma  2,  that  the  0-contact  fault  is  masked 
by  the  1 -contact  fault.  Moreover,  by  Lemma  3.  this  1 -contact  fault 
on  row  ;  might,  in  turn,  be  masked  by  some  other  0-contacl  fault  in 
some  other  row,  say  j  A  i.  However,  since  by  our  assumption  row  j 
does  not  contain  any  1 -contact  fault,  it  is  clear  from  Lemma  2  that 
this  0-conlact  fault  cannot  be  masked  by  any  single  contact  fault 
component  of  the  given  multiple  fault.  In  fact,  it  can  furthermore  be 
seen  from  the  proof  of  Lemma  2  that  this  0-contact  fault  cannot  even 
be  masked  by  any  multiple  contact  fault  component  of  the  given 
multiple  fault.  Therefore,  any  test  in  Tc  which  tests  this  0-contact  will 
test  the  given  multiple  contact  fault  as  well.  Q.E.D. 

Although  the  condition  in  Theorem  1  is  only  a  sufficient  condition, 
it  is  easy  to  show  that  most  multiple  faults  of  interest,  such  as  faults 
of  size  8  and  less  |7).  are  included  in  this  condition.  We  begin  by 
showing  that  Theorem  1  implies  that  most  multiple  contact  faults  of 
size  4  are  covered  by  each  Tc. 

Theorem  2:  Out  of  the  total  (m<J<"+r>)  different  contact  faults  of 
size  4  in  an  irredundant  PLA,  at  the  most  (")  -  (n  +  p/2)*  faults  are 
not  covered  by  every  Tr  of  the  PLA. 

Proof:  By  Theorem  1 ,  every  multiple  fault  of  size  4  which  docs 
not  form  the  four-way  masking  cycle  is  bound  to  be  covered  by  each 
T,  .  So  Theorem  2  would  immediately  follow  if  we  show  that  the 
maximum  number  of  contact  faults  of  size  4  with  four-way  masking 
is  (?)-(«  +P/2)*. 

Let  «|0'2orr0'4  be  one  such  contact  fault.  Since  it  must  exist  on 
exactly  some  2  of  the  m  rows  of  the  PLA,  there  arc  (”)  different  pairs 
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of  two  rows  which  are  possible  locations  of  Let  W'( ,  and 

W',j  be  one  of  these  (f)  pairs,  and  assume  that  «i  and  «j  exist  on 
and  or 3  and  a4  exist  on  Wi2.  The  total  number  of  contacts  on  each  row 
is.  of  course.  2n  +  p.  l.ct  the  number  of  0-contacts  and  I -contacts, 
respectively,  on  row  IF,,  be  s  i  and  2n  +  p  —  s | .  and  on  row  Wl2  be 
s2  and  2n  +  p  —  s2  where  s  i  >  0  and  s2  >  0.  Assuming  now  that  ai 
and  at  are  O-contact  faults  and  a?  and  o4  are  1  -contact  faults,  we  note 
that  all  the  different  ways  of  having  such  ct|.  as,  oj,  and  a4  on  Wn 
and  lf',2  are 

|s,|  |2n  +  p  -  s,|  |s2|  |2n  +  p  -  s2 

=  s,(2n  +  p  -  s,)(s2)(2tt +p  -  s2).  (5) 


Clearly,  the  maximum  value  of  the  expression  in  (5)  results  when 


S|  =  S;  = 


2n  +  P  -  „.P 
2  ”  ^  2 


It  thus  follows  that  the  maximum  number  of  contact  faults  of  size 
4  with  four-way  masking  is  (2)  •(/!  +  p/2)4.  Q.E.D. 

For  m  =  48,  n  =  16,  and  p  =  8,  the  number  (")  ■  (n  +  p/2)4  is  a 
mere  0.03  percent  of  the  total  (m'2J 4p)).  Thus,  99.97  percent  of  all 
multiple  contact  faults  of  size  4  are  bound  to  be  covered  by  each  Tc 
in  a  typical  irredundant  PLA. 

Theorem  2  can  now  be  generalized  for  multiple  faults  of  larger 
sizes,  as  follows.  For  the  proof  of  this  theorem,  see  [  1 5J. 

Theorem  3:  Out  of  the  total  (ml2"+/’>)  r  >4, different  faults  of  size 
r  in  an  irredundant  PLA.  at  the  most  (?*>-(/»  +  p/2)4  • 
faults  are  not  covered  by  every  Tc  of  the  PLA. 

Note  that  the  bound  given  in  Theorem  3  is  simply  an  upper  bound. 
In  fact,  for  large  values  of  r.  the  bound  (7)  -(n  +  p/2)4  •  (m(uff/~4) 
becomes  greater  than  However,  for  small  values  of  r. 

Theorem  3  is  very  convenient  to  use.  For  example,  when  m  =  48,  n 
=  !6.  and  p  -  8.  Theorem  3  shows  that  more  than  98  percent  of  all 
multiple  contact  faults  of  size  8  and  less  in  an  irredundant  PLA  are 
covered  by  each  Tr  of  the  PLA.  This  result  becomes  very  significant 
in  the  light  of  a  recent  paper  [7],  wherein  it  is  shown  that  the  con¬ 
sideration  of  faults  of  size  8  and  less  is  sufficient  for  most  LSI  chips. 
Thus,  in  conclusion,  we  have  shown  that  even  though  contact  faults 
in  PLA's  exhibit  the  well-known  masking  phenomenon,  yet  more  than 
98  percent  of  all  multiple  contact  faults  of  interest  are  covered  by  their 
single  contact  fault  test  sets. 


IV.  Conclusion 

The  problem  of  multiple  fault  coverage  by  single  contact  fault  test 
sets  in  PLA's  has  been  considered  in  this  paper.  Since  a  PLA  is  con¬ 
ceptually  a  two-level  network,  it  may  seem  that  each  complete  single 
contact  fault  lest  set  for  a  PLA  must  inherently  cover  all  of  its  multiple 
contact  faults  as  well.  However,  in  Section  II  an  SAE  (stuck-at 
equivalent)  network  of  a  PLA  was  developed  to  show  that  such  is  not 
the  case  in  general.  Nonetheless,  it  is  also  shown  by  using  SAE  net¬ 
works  that  more  than  98  percent  of  all  multiple  contact  faults  of  in¬ 
terest  are,  indeed,  covered  by  every  single  fault  test  set  in  an  irre¬ 
dundant  PLA. 

The  results  mentioned  above  have  been  developed  for  and  OR 
PLA's  with  single  input  decoders  only.  However,  similar  results  for 
NOR  NOR  PLA's  with  two  input  decoders  are  easily  derived.  More 
specifically,  the  personality  for  a  NOR  NOR  PLA  is  determined  by 
using  Definition  I  for  both  input  and  output  columns.  Moreover,  since 
a  NOR  gale  is  an  OR  followed  by  a  NOT,  the  SAE  network  for  such 
a  PLA  is  obtained  by  using  the  network  of  Fig.  3.  Finally,  an  internal 
variable  A,t  is  then  defined  as  A,2  =  a,jXjXJ+ 1  +  b^XjXj* 1  +  c,jXjX)+  ( 
+  d,jXjXj+ 1  where  1  =  1, 2,  ,m  andy  =  1, 3,  , n  —  l.Using  these 

formulations,  results  similar  to  Lemmas  1-4  and  Theorems  1-3  of 
Sections  II  and  III  can  be  easily  proven. 

In  the  above  analysis,  we  had  tacitly  assumed  that  each  single 
contact  fault  of  a  PLA  is  detectable.  But.  in  practice,  most  PLA's  tend 
to  contain  various  undetectable  faults.  The  multiple  fault  coverage 
capability  of  such  PLA's,  by  their  single  fault  test  sets,  is  generally 
reduced.  This  claim  is  best  illustrated  by  considering  a  double  contact 
fault,  say  a  0-contact  fault  and  a  I -contact  fault,  on  some  row  of  a 


PLA.  By  Lemmas  2  and  3.  we  then  know  that  even  if  the  I -contact 
fault  masked  the  0-contacl  fault,  the  converse  is  never  true.  Thus,  the 
double  fault  is  sure  to  be  covered  by  a  complete  single  fault  test  set 
as  long  as  the  1  -contact  fault  is  detectable.  However,  if  the  1  -contact 
fault  is  undetectable  and  it  masks  the  0-contact  fault,  then  the  double 
fault  might  not  be  detected  by  a  complete  single  fault  test  set. 

The  above  illustration  can  similarly  be  carried  out  for  multiple 
faults  of  larger  sizes.  Thus,  it  seems  evident  that  the  multiple  fault 
coverage  capability  of  a  Tc  in  a  redundant  PLA  will  be  highly  de¬ 
pendent  on  the  personality  of  the  PLA  and  the  redundant  contacts. 
In  other  words,  no  general  results  like  Theorems  I  to  3  of  the  last 
section  can  be  specified  for  redundant  PLA's.  Nonetheless,  one  easily 
notices  from  the  proof  of  Theorem  1  that  any  multiple  contact  fault, 
which  contains  at  least  one  detectable  0-contact  fault  component  in 
any  row  containing  only  0-contact  fault  components,  would  always 
be  covered  by  each  Tc.  Further  work  along  these  lines,  however,  needs 
to  be  done. 

Finally,  the  advantage  of  the  SAE  network  models  of  PLA’s  in 
areas  other  than  the  multiple  contact  fault  coverage  problem  is  worth 
mentioning  here.  Note,  for  instance,  that  a  Tc  is  basically  an  in¬ 
complete,  but  very  specific  single  stuck-at  fault  test  set  for  a  SAE 
network.  Thus,  by  taking  into  account  the  explicit  structure  possessed 
by  all  SAE  networks,  it  can  be  easily  proven  that  a  Tc  in  general  will 
cover  most  of  the  single  stuck-at  faults  of  a  SAE  network.  Moreover, 
it  will  then  immediately  follow  that  each  Tc  also  covers  most  stuck-at 
faults  of  sizes  up  to  8  [  1 1  ].  In  short,  a  Tc  for  a  PLA  can  be  considered 
a  very  effective  test  set  for  all  single  and  multiple  contact  and  stuck-at 
faults  of  interest. 
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Abstract — Given  any  combinational,  internal  fan-out-free  network 
and  any  complete  single  fault  detection  test  set  (SFDTS)  for  the  net¬ 
work,  we  consider  in  this  paper  the  problem  of  determining  the  minimal 
extent  to  which  that  SFDTS  will  cover  multiple  faults  in  the  network. 
The  basis  of  our  approach  is  the  development  of  a  generic  perspective 
to  multiple  faults  which  uses  a  representation  of  such  faults  called  an 
L-expression.  This  perspective  leads  to  a  technique  for  obtaining  the 
greatest  lower  bound  on  the  multiple  fault  coverage  capability  of  an 
SFDTS  by  means  of  a  simple  table  look-up  process.  In  addition  to 
generalizing  previously  known  results  regarding  multiple  fault  cover¬ 
age,  two  particularly  interesting  results  obtained  from  this  approach 
are  as  follows: 

1)  On  the  average,  every  SFDTS  for  an  internal  fan-out-free  net¬ 
work  covers  92  percent  of  all  multiple  faults  of  sizes  8  and  less. 

2)  On  the  average,  every  SFDTS  for  an  internal  fan-out-free  net¬ 
work  covers  at  least  46.1  percent  of  all  multiple  faults. 

Index  Terms— Coverage  bounds,  coverage  table,  fault  vectors, 
generic  representations,  L-expressions,  internal  fan-out-free  networks, 
single  and  multiple  fault  detection. 

I.  Introduction 

THE  increasing  density  of  logic  on  integrated  circuit  chips 
together  with  the  emerging  LSI  design  practice  [23)  of 
implementing  sequential  networks  in  the  form  of  register  to 
register  combinational  logic  have  led  to  a  renewed  interest  in 
various  unsolved  problems  associated  with  the  detection  of 
stuck-type  faults  in  combinational  networks.  Perhaps  the  most 
basic  of  such  problems  is  the  generation  of  tests  which  effi¬ 
ciently  detect  the  presence  of  multiple  faults  in  a  given  com¬ 
binational  network.  The  complexity  of  this  problem  stems  from 
the  fact  that  in  a  network  of  p  lines,  there  are  3f-l  different 
multiple  faults  which  might  exist.  To  combat  this  prohibitively 
large  number  of  faults,  various  concepts  such  as  functional 
equivalence  [24]  and  fault  collasping  [25)  have  been  devel¬ 
oped.  Testing  algorithms  [26],  [27]  based  on  these  concepts 
show  that  instead  of  considering  3M  multiple  faults,  it  is 
possible  to  consider  a  smaller  number  of  faults  while  still  being 
able  to  detect  the  presence  of  any  of  the  3/>- 1  faults.  However, 
even  the  use  of  these  algorithms  does  not  provide  a  satisfactory 
solution  for  large  networks. 

A  practical  compromise  often  employed  to  alleviate  this 
multiple  fault  detection  problem  is  to  use  a  complete  single 
fault  detection  test  set  (SFDTS)  for  a  network  to  cover  its 
multiple  faults  as  well.  However,  again  because  of  the  large 
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number  of  multiple  faults  involved,  this  compromise  has  had 
little  theoretical  justification  or  investigation  other  than  the 
establishment  of  such  results  as  an  SFDTS  is  a  multiple  fault 
detection  test  set  (MFDTS)  for  two-level  networks  or  every 
SFDTS  in  an  internal  fan-out-free  network  covers  all  multiple 
faults  of  sizes  2  and  3  [  10]- [13].  Motivated  by  the  above,  the 
authors  previously  introduced  the  concepts  of  the  functional 
form  of  multiple  faults  [9]  and  of  recursive  coverage  projection 
of  test  sets  [10]  to  theoretically  investigate  more  general  sit¬ 
uations.  These  developments  led  to  interesting  extensions  of 
previously  known  results  [1 1  ] - [  1 3]  and  new  characterizations 
of  some  special  cases  of  multiple  faults.  In  this  paper  we  con¬ 
tinue  this  work  by  developing  a  general  approach  to  quanti¬ 
tatively  evaluate  the  validity  of  the  common,  practical  com¬ 
promise  mentioned  above.  This  approach  leads  to  a  technique 
for  obtaining  the  greatest  lower  bound  on  the  multiple  fault 
coverage  capability  of  a  SFDTS  on  a  set  of  lines  by  means  of 
a  simple  table  look-up  process.  Two  particularly  interesting 
results  obtained  using  this  table  are  as  follows. 

1 )  On  the  average,  every  SFDTS  for  an  internal  fan-out- 
free  network  covers  92  percent  of  all  multiple  faults  of  sizes 
8  and  less. 

2)  On  the  average,  every  SFDTS  for  an  internal  fan-out- 
free  network  covers  at  least  46.1  percent  of  all  multiple 
faults. 

II.  Network  Model  and  Notational 
Preliminaries 

A.  Network  Model 

For  multiple  fault  detection  purposes,  it  is  convenient  to 
classify  a  single  output  combinational  switching  network  into 
one  of  the  following  three  classes; 

Cl:  Class  of  Fan-Out-Free  Networks  [/]-[J],  \28\-.  A 
network  in  which  no  line  fans  out  to  two  or  more  lines  is  re¬ 
ferred  to  as  a  fan-out-free  network  [see  Fig.  1(a)]. 

C2:  Class  of  Internal  Fan-Out-Free  Networks  [#]-[//]: 
In  an  internal  fan-out-free  network,  fan-out  is  allowed  on  the 
primary  input  lines  only  [see  Fig.  1(b)]. 

C3:  Class  of  Reconuergent -Fan-Out  Networks  [I2]-\IS]'. 
A  network  from  this  class  can  have  fan-out  on  any  line  [see  Fig. 
1(c)]. 

Clearly,  Cl  C  C2  c  C3.  The  reason  this  classification 
scheme  is  centered  around  the  amount  of  fan-out  present  in 
a  network  is  that  fan-out  is  the  single  most  important  factor 
which  affects  the  complexity  of  generating  test  sets  for  net¬ 
works.  Indeed,  networks  of  Class  Cl  are  the  simplest  to  test 
and  networks  of  Class  C3  are  the  most  difficult.  In  fact,  very 
simple  and  useful  methods  to  generate  MFDTS's  for  networks 
of  Class  Cl  are  well  known  [4}-[7]  in  the  literature.  On  the 
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Fig.  I.  Illustration  of  three  classes  of  networks. 


other  hand,  it  is  equally  well  known  that  the  generation  of  an 
M  FDTS  for  a  complex  network  of  Class  C3  can  be  a  prohib¬ 
itively  difficult  task. 

In  terms  of  the  capability  of  each  of  these  classes  of  networks 
to  realize  Boolean  functions,  it  is  known  [1],  [3]  that  functions 
which  can  be  realized  by  Class  Cl  are  very  limited.  However, 
it  can  be  shown  that  Classes  C2  and  C3  are  complete  in  the 
sense  that  every  Boolean  function  can  be  realized  by  a  network 
of  Class  C2  and,  therefore,  also  of  Class  Ci.  The  advantage 
of  Class  Ci  networks  over  Class  C2  networks  is,  of  course,  that 
the  former  type  of  networks  require  fewer  gates  for  the  reali¬ 
zation  of  same  functions.  However,  since  Class  C2  is  a  com¬ 
plete  class  in  the  sense  just  mentioned  above  and  since  Class 
C2  net  works  are  easier  to  manipulate  than  Class  Ci  networks 
from  a  multiple  fault  detection  point  of  view,  we  will  restrict 
our  considerations  in  this  paper  to  networks  of  Class  C2  only. 
This  is  not  a  severe  limitation  because  each  network  of  Class 
Ci  can  be  converted  to  a  network  of  Class  C2  [  1 3] .  Hence,  our 
results  can  be  exploited  for  the  networks  of  Class  Ci  as 
well. 

Thus,  let  A'  denote  an  internal  fan-out-frec  network  such 
as  that  show  n  in  Fig.  1(b).  For  the  convenience  of  analysis  in 
this  paper,  we  will  make  two  simplifying  assumptions  on  the 
line  faults  which  can  exist  in  such  networks.  First  of  all,  since 
a  stem  fault  (such  as  the  one  shown  in  Fig.  1(b)]  is  functionally 
equivalent  to  some  branch  faults  (lines  2  and  3  s-a-l),  we  will 
assume  that  each  multiple  fault  under  consideration  is  such 
that  it  does  not  include  any  stem  faults.  Second,  since  two  or 
more  simultaneous  single  faults  (that  is,  one  multiple  fault) 
on  the  same  path  from  a  primary  input  to  the  primary  output 
are  collectively  functionally  equivalent  to  the  single  fault 
closest  to  the  primary  output  on  that  path,  we  will  assume  that 
each  multiple  fault  under  consideration  is  such  that  it  has  at 
the  most  one  single  fault  on  each  such  path.  It  should  be  clear 
that  these  two  assumptions  do  not  in  any  way  reduce  the 


number  of  effective  multiple  faults  under  consideration.  The 
implications  of  these  two  assumptions  will  be  soon  made  ap¬ 
parent. 

B  Rotational  Preliminaries 

To  introduce  the  notational  preliminaries  used  in  the  paper, 
consider  again  the  internal  fan-out-free  network,  R,  shown  in 
Fig.  1  (b).  Let  K  —  |2, 4, 6,  8)  be  a  set  of  4  lines  in  A/.  A  simul¬ 
taneous  multiple  stuck-type  fault  involving  all  the  four  lines 
of  K  will  be  denoted  as  a  binary  fault  vector  er  A  =  («2,  «4,  o<„ 
an),  where  a,«|0,  I),  nK.  and  i  is  an  index  parameter.  Clearly, 
there  can  be  24  different  such  fault  vectors  on  A.  The  set  of 
these  24  fault  vectors  on  A  will  be  denoted  by  Aa  =  l«J..  « A. 
•  ,  «A|  and  referred  to  as  the  fault  complex  on  K.  Whenever 

possible,  we  will  write  «A  instead  of  o'A  to  denote  a  general 
element  of  FA. 

Next,  let  Yy  =  xi.  Yq  =  xy,  Y(,  =  x$.  and  T*  =  X7  be  the 
Boolean  functions  realized  by  lines  2,  4, 6,  and  8,  respectively. 
These  K's  will  be  referred  to  as  line  variables  to  differentiate 
them  from  the  primary  input  variables.  The  primary  output 
function  of  N  in  terms  of  the  primary  input  variables,  X],  xj. 

■  ,  Xy,  and  the  line  variables,  Y 2.  K4.  Yt>.  Yx.  will  be  written 
as  Z\X\  VA],  where 

Z[X.  Yk]  =  (T)  Y-y  +  X2K4)(x4}'A  +  TfcFs).  (2.1) 

}  A  =  |  }'2.  }'4.  Yf,.  Kr|  and  X  =  |.\  |,  x2.  .  *7).  Given  a  specific 

input  vector,  denoted  as  X,.  the  Boolean  output  of  R  will  then 
be  Z[Xj\  )  A]  when  R  is  fault-free,  and  Z[A',;  oA]  when  a  «A 
e  Fk  exists  in  R. 

Note  that  each  Y,  e  YK  appears  exactly  once  in  ZlA';  )'A]. 
In  fact,  because  of  the  two  assumptions  made  earlier  about  the 
multiple  faults  in  internal  fan-out-free  networks,  it  should  be 
clear  that  any  set  A'  of  lines  we  consider  will  always  be  such 
that  Z\X\  >'A  ]  can  be  expressed  in  a  form  wherein  each  Y,  q 
Yk  occurs  exactly  once  in  the  expression,  either  as  Y,  or  Y,. 
Such  a  form  is  referred  to  as  the  fan-out-free  form  (3].  In  the 
following  each  Z[A';  KA]  will  always  be  assumed  to  have  been 
expressed  in  the  fan-out-free  form. 

Definition  I :  Two  sets,  say,  /  and  J  are  said  to  form  a  par¬ 
tition  of  K  if  /  and  J  are  disjoint  sets  of  lines  in  R  such  that  / 
u  J  =  A'  and  there  exist  Boolean  functions,  say,  W|A';  T/], 
4/(A’;  Yj).  and  0(A]  so  that  Z|A';  >'A  ]  =  (H |A';  >  /]*Af|A'; 

)  2])  +  OJA'],  where  *  denotes  Boolean  OR,  “+”,  or  Boolean 
AND. 

Note  that  because  of  the  fan-out-free  form  of  Z(A-;  FAJ, 
each  A  will  always  have  at  least  one  partition.  For  instance, 
in  our  previous  example,  /  =  (2, 4j  and  J  =  |6, 8]  form  a  parti¬ 
tion  of  A  with  H\X;  Y,)  =  (x,  >'2  +  x2y4).  M\X\  Yj]  =  (x4>'6 
+  x6 )'((),  and  0[X]  =  0.  Using  this  concept  of  partition  in  a 
recursive  manner,  it  is  possible  to  define  an  ordering  on  any 
set  of  lines  in  R.  More  particularly,  suppose  that  sets  J  and  J 
were  some  ordered  sets  /  =  (/ 1, 12,  ,  ip)  and  J  =  (j\.ji. 

jq).  Then  the  set  A'  of  which  /  and  J  form  a  partition  would  be 
ordered  as  A  =  (i|.  i  2.  .  ip.jt.j2.  .jq)-  With  this  ordering 

on  A,  it  is  possible  to  write  Z[A';  TA]  such  that  each  Y,  e  )'A 
appears  in  Z|Af;  KA]  in  the  same  order  (from  left  to  right)  as 
the  corresponding  l  appears  in  A.  For  a  given  A,  such  an  or¬ 
dered  Z[X\  Yk)  will  be  unique  except  for  variations  possible 
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because  of  the  associative  and  commutative  properties  of  “+" 
and 

Another  advantage  of  the  ordering  scheme  is  that  the  fault 
complex  on  A  can  be  very  conveniently  expressed  in  terms  of 
fault  complexes  on  its  partition  sets.  In  particular  note  that  if 
/,  J,  and  A'  were  ordered  as  mentioned  above,  then  the  Car¬ 
tesian  product  F/  X  Fj  will  be  isomorphic  to  F* .  Thus,  for 
notational  convenience  we  will  write  that  F*  =  F/  X  Fj 
whenever  /  and  J  form  a  partition  of  A.  More  generally,  if  A 
s  Fj  and  B  e  Fj.  A  X  B  will  represent  a  subset  of  F*. 

C.  Motivation 

Let  T,  be  an  SFDTS  for  N,  A  be  a  set  of  lines  in  A',  and 
suppose  that  we  want  to  determine  all  the  fault  vectors  of  F* 
which  are  covered  by  T,.  Since  a  <**  e  F*  is  covered  by  an  X, 
iff  Z[X,;  Yk J  tA  Z[ Xj\  a*],  it  is  clear  that  any  method  used 
to  solve  this  problem  would  directly  or  indirectly  need  all  the 
information  contained  in  Z[X\  Y K ]  However,  because  Z\X; 
Yk  ]  is  inherently  dependent  on  the  particular  A  and  A  under 
consideration,  it  is  equally  obvious  that  all  such  methods  are 
bound  to  be  impractical  for  any  general  use.  Thus  our  moti¬ 
vation  is  to  develop  a  method  which  will  utilize  the  minimal 
possible  information  about  A.  N,  and  Ts.  and  still  be  able  to 
determine  some  reasonable  information  about  the  capability 
of  T,  to  cover  F*.  It  will  be  shown  in  this  paper  that  it  is  pos¬ 
sible  to  determine  the  greatest  lower  bound  on  the  number  of 
fault  vectors  of  F*  covered  by  any  SFDTS  for  N  simply  by  a 
table  look-up  process.  In  other  words,  the  information  used 
about  A  and  N  will  be  such  that  it  can  be  tabularized  for 
general  use.  However,  as  a  consequence  of  not  using  complete 
information  about  Z[X\  K*],  we  will  get  only  greatest  lower 
bounds  on  coverage. 

The  main  concept  developed  to  obtain  our  results  is  that  of 
the  generic  representation  of  logical  interconnection  of  line 
variables.  To  first  illustrate  what  is  meant  by  the  logical  in¬ 
terconnection  of  line  variables,  consider  again  the  set  A  =  {2, 
4,  6,  8|  of  lines  in  N  of  Fig.  1(b),  wherein  Z\X\  Yk]  =  (xjK2 
+  JC2K4KX4K6  +  Xf, Fg).  Given  any  two  line  variables,  such  as 
Y 2  and  K4,  when  we  refer  to  their  logical  interconnection  we 
mean  the  logical  AND  or  logical  OR  operation  which  occurs 
between  them  in  the  fan-out-free  form  of  Z\X\  Yk]-  Thus,  K2 
and  Ya  are  logically  ORed,  and  so  are  K6  and  Kg.  But  Y2  and 
Yf,  are  logically  ANDed,  as  are  K4  and  K<„  K2  and  Kg,  and  K4 
and  Kg.  This  describes  the  complete  logical  interconnection 
of  all  the  line  variables  for  the  given  A,  and  can  be  succinctly 
restated  as  follows:  K2  and  K4  arc  logically  ORed  and  the  two 
together  are  logically  ANDed  with  the  logically  ORed  K6  and 
Kg.  As  another  example,  let  Z[X\  K*]  =  (xi  K2  +  X3-V4K3)  for 
A  =  |2,  3|  in  some  N.  Note  that  even  though  Z[X\  K*]  is 
written  in  the  fan-out-free  form,  the  logical  interconnection 
between  K2  and  K3  is  that  of  NOR,  which  is  not  included  in  our 
definition.  By  using  the  Demorgan  s  Law,  however,  Z[X:  Yk  J 
can  be  rewritten  as  (jc  1  +  K2MX3  +  *4  +  K3),  indicating  that 
the  logical  interconnection  between  K2  and  K3  is  that  of  and. 
Thus,  to  use  our  concept  it  will  be  ?  umed  in  the  following  that 
Z\X:  Yk  ]  is  always  written  such  that  there  is  no  complemen¬ 
tation  operation  involving  two  or  more  line  variables.  Also  note 
from  the  above  two  examples  that  this  concept  of  logical  in¬ 


terconnection  of  line  variables  is  not  concerned  with  whether 
or  not  a  line  variable  is  complemented,  or  whether  primary 
input  variables  arc  ANDed  or  ORed  with  a  line  variable. 

Given  the  information  on  the  logical  interconnection  of  line 
variables  for  a  set  A  in  N,  by  the  generic  representation  of  this 
information  we  mean  a  representation  in  which  no  information 
is  kept  of  the  particular  line  variables  involved  but  only  of  how 
these  line  variables  are  logically  interconnected.  Thus,  the 
generic  representation  in  the  case  of  Z\X\  Y k\  =  (S1K2  + 
X2Ya)(xaY(,  +  3?6Kg)  is  that  there  are  two  logically  ANDed 
pairs  of  two  logically  ORed  line  variables  each.  We  will  show 
in  the  next  section  that  such  generic  representations  of  any  set 
of  lines  in  any  network  can  be  conveniently  expressed  in  terms 
of  /-expressions. 

III.  L- Expressions  and  Generic  Representation 
A.  L-Expressions 

An  /.-expression  is  an  algebraic  expression  consisting  of  a 
given  number  of  “L"  symbols  which  are  interconnected  by 
commutative  and  associative  “+”  and  operators  and  pa¬ 
rentheses.  More  formally,  we  define  /.-expressions  as  fol¬ 
lows: 

Definition  2: 

a)  (/)  is  an  /.-expression. 

b)  If  (P)  and  ((?)  are  /.-expressions,  so  are  ((F)  +  ((?)) 
and  ((F)  •  (Q)) 

c)  Nothing  else  is  an  /-expression  unless  its  being  so 
follows  from  a  repeated  use  of  a)  and  b). 

Two  /-expressions  are  said  to  be  “equal"  or  “same"  if  one 
can  be  made  identical  to  the  other  by  using  commutative  and 
associative  properties  of  “+"  and  namely, 

(F)  +  (Q)  =  ((?)  +  (F) 

(P)-(Q)  =  (0-(F) 

(F)  +  ((G) +  (#?))  =  ((F) +  (0))  +  (K) 

(F)-  ((G)  •(*))=  ((F)  -(G))  •(/?). 

Thus,  for  example,  ((/)  +  (/))•  (/)  is  the  same  as  (/)  •  ((/) 
+  (/)).  For  the  sake  of  convenience,  the  use  of  parentheses  in 
writing  /-expressions  will  be  minimized  whenever  there  is  no 
confusion.  Moreover,  we  will  replace  all  the  sign  appear¬ 
ances  by  simple  concatenations.  Using  these  conventions,  all 
/-expressions  consisting  of  4  and  less  “ L "  symbols  are  showm 
in  Table  1. 

Let  /(e)  denote  the  set  of  all  /-expressions  each  of  which 
consists  of  v  ”L”  symbols.  Each  clement  of  /(c)  will  be  referred 
to  as  an  /-expression  of  size  o,  and  denoted  as  /,'.,  i  =  1.2,, 
|/(e)|.  or  simply  as  /,.  whenever  possible.  Note  now  from 
Table  I  and  Definition  2  that  for  all  v  >  I ,  there  is  an  inherent 
duality  among  all  /-expressions  of  L(v).  More  particularly, 
we  have  that  if  /,  =  Lr  +  Lq  (or,  L,  =  LpLq),  for  some  Lr  e 
L(p)  and  Lq  e  L(q).  p  A  q  =  v.  then  there  exists  the  dual  L- 
expression,  denoted  Lf.  of  Z,  in  /(c)  such  that  Zf  =  LpLdq 
(correspondingly,  Ld  —  LpA  Ldq).  where  of  course  Ld  e  L(p) 
and  Ld  e  L(q)  are  the  dual  /-expressions  of  Lp  and  Lq.  re¬ 
spectively.  An  /-expression  /,  will  be  said  to  be  essentially- 
and  (essentially-OR)  if  /,  =  LpLq  (/,  =  Lp  +  /,)  for  some 
Lp  and  Lq.  Clearly,  each  /-expression  is  either  cssentially-AND 
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1*1 
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III  LU«L* 

i*  1*1  l»U 

4 
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<1*1*01  H*U>  L 

in ♦  l  mum 

#t*l»  ll*  l» 

ll*U 

or  esscntially-OR  but  not  both,  and  the  dual  of  an  csscntially- 
AND  (esscntially-OR)  /.-expression  is  an  essentially-OR  (es- 
sentially-AND)  /.-expression. 

We  end  this  subsection  with  the  following  two  lemmas  about 
the  cardinality  of  the  set  L(v).  For  more  details  about  the 
enumeration  of  different  /.-expressions  the  reader  is  referred 
to  117]  and  [9]  wherein  it  is  shown  that  the  enumeration  of 
|  /.(f)  |  is  the  same  as  the  enumeration  of  two-terminal  se¬ 
ries-parallel  networks  [  1 8]— [20].  |Z(r)|.  for  all  v  <  10,  is 
listed  in  Table  II. 

Lemma  I:  Given  any  set  L(v),  v  >  2,  there  are  exactly  j  Z(r 
—  1)|  essentially-AND  (essentially  OR)  /.-expressions  in  Z(r) 
each  of  which  can  be  written  as  /,._,/  (correspondingly,  as 
Z.,-i  +  L),  where  /,_,</( v  —  1). 

The  proof  of  this  lemma  follows  from  Definition  2. 

Lemma  2:  For  larger.  |Z(r)|  —  (0.43)  X  (3.56) *'  X  r-3/-. 
Thus,  |Z(r-  I)]  =*  0.28  \L(v)\- 

The  proof  of  this  lemma  can  be  found  in  [19], 

B  Generic  Representation 

Consider  Z\X ;  >'A  ]  =  (3c, )2  +  x2Yf)(xiYf>  +  xhY*)  for  K 
=  |2,  4,  6,  8|  in  Fig.  1(b).  From  our  previous  discussions,  we 
know  that  the  information  about  the  generic  representation 
of  the  logical  interconnection  of  line  variables  of  YK  in  this  case 
is  that  there  are  two  logically-ANDed  pairs  of  two  logically 
ORed  line  variables  each.  This  information  is  unambiguously 
and  conveniently  represented  by  the  /.-expression  (Z  +  L)(L 
-F  L)  if  each  “Z.”  is  assumed  to  stand  for  a  line  variable.  More 
particularly,  given  any  set  K  of  lines  in  A,  we  can  obtain  by 
using  the  following  procedure  an  /.-expression  from  the  cor¬ 
responding  Z\X;  ) a  ]  such  that  the  resulting  /.-expression  will 
generically  represent  the  logical  interconnection  of  line  vari¬ 
ables  in  Z\X:  )*]. 

Procedure:  Given  a  Z[ X\  Yf  ]  in  the  fan-out-free  form,  j  A| 
=  l\  first  replace  each  literal  of  .V  in  Z(X;  }'*-]  by  a  “0"or  “1” 
such  that  the  simplified  expression  consists  only  of  all  the  v  line 
variables.  Then,  replace  each  line  variable  in  the  simplified 
expression  by  "L".  The  resulting  expression  is  the  required 
/.-expression,  Z,  e  L(v). 

It  is  obvious  from  this  procedure  and  the  fact  that  each  Zl-Y; 
Ka]  can  be  uniquely  written  in  an  ordered,  fan-out-free  form 
that  the  resulting  /.-expression  will  be  unique  for  a  given  set 
of  lines.  Such  an  /-expression  will  be  said  to  be  the  associated 
/-expression  with  K. 

In  the  remainder  of  this  paper  wc  will  show  that  given  a  set 
K  of  c  lines  in  any  A  and  its  associated  /-expression,  /, ,  it  is 
possible  to  determine  the  greatest  lower  bound  on  the  number 
of  fault  vectors  of  Fa  covered  by  every  SFDTS  for  A'  simply 
by  knowing  /, .  An  immediate  corollary  of  this  result  is  that 


TABLE  It 
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3 

4 

4 

10 

> 
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b 

66 

7 

180 

fc 

b2? 

9 

133? 

10 

46?4 

for  all  possible  sets  of  r  lines,  in  all  possible  internal  fan-out- 
free  networks,  with  which  the  associated  /-expression  is  the 
same,  the  greatest  lower  bound  on  the  coverage  will  also  be  the 
same.  Thus,  if  the  greatest  lower  bound  corresponding  to  each 
possible  /-expression  were  recorded  in  a  table,  then  given  any 
set  A  of  lines  in  any  A,  the  bound  may  be  obtained  simply  by 
a  table  look-up  process.  This  general  and  simple  use  of  Z- 
expressions  will  be  the  result  developed  and  validated  in  the 
following  sections  of  the  paper. 

IV.  Coverage  Table 
A.  The  Concept  of  a  Coverage  Table 

Consider  an  internal  fan-out-free  network  A,  and  a  set  K 
of  c  lines  in  A.  Let  T  be  any  set  of  input  vectors  which  detects 
all  2r  single  stuck-typc  faults  on  K.  (T,  of  course,  could  also 
detect  other  faults.)  Relative  to  A,  let  <t>(K,T)  denote  the 
number  of  fault  vectors  of  FA  also  detected  by  T.  Finally  <t>(K) 
be  the  minimum  0(A,  F)  for  all  such  T  in  A.  Since  each 
SFDTS  for  A  automatically  covers  all  2c  single  faults  on  A, 
it  follows  that  at  least  0(A)  fault  vectors  of  I  f  must  be  covered 
by  each  SFDTS  for  A. 

Suppose  now  that  the  /-expression  associated  with  K  is  /, . 
In  Section  IV-B  an  algorithm  is  presented  which,  given  an 
/-expression  such  as  Z, ,  determines  a  unique  number,  denoted 
as  0{Z, .).  In  the  next  section  we  will  prove  that  6(Lr)  is  the 
greatest  lower  bound  on  0(A)  in  the  following  sense: 

Property  PI:  Given  any  A  and  a  set  A  of  r  lines  with  which 
/,  is  the  associated  /-expression, 

#(/,)  <  0(A). 

Property  P2:  There  exists  at  least  one  A  with  a  set  A  of  c 
lines  with  which  Z,  is  the  associated  /-expression  and 


i 


»(/,-)  =  0(A). 

In  other  words,  each  SFDTS  of  A  covers  at  least  $(/,.)  fault 
vectors  of  Fa.  and  there  exists  at  least  one  A  and  an  SFDTS 
for  it  which  covers  no  more  than  0(LV)  fault  vectors  of  Fk. 
Thus,  (/(/.,  )  provides  the  greatest  lower  bound  on  the  capability 
of  any  SFDTS  to  cover  fault  vectors  of  FA.  Note  that  <?(/,.) 
is  the  greatest  lower  bound  over  all  possible  sets  of  lines  in  all 
possible  networks  u'ith  which  the  associated  /-expression  is 
/,..  Thus,  it  might  happen  in  a  general  network  that  each 
SFDTS  for  the  network  covers  more  than  #(/,  )  fault  vectors 
of  each  set  of  lines  with  the  associated  /-expression,  /, .  Re¬ 
gardless.  our  main  interest  lies  only  in  the  most  general  nature 
of  multiple  fault  coverage  capabilities  of  single  fault  detection 
test  sets. 

The  table  in  which  all  /.-expressions  and  their  corresponding 
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V 

8(L^) 

V 

* 

6(L2) 

1 

L 

2 

6 

(LLL+L) (L+L) 

56 

( (L+L) L+L) (L+L) 

56 

2 

L+L 

(LL+L+L) (L+L) 

54 

LL 

(L+L+L+L) (L+L) 

56 

3 
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e 

7 
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112 

LL+L 

( (L+L+L) L+L) (L+L) 
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(L+LIL 

((L+L) (L+L) +L) (L+L) 
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LLL 

((L+L) LL+L) (L+L) 

112 

4 

L+L+L+L 

16 

(LLLL+L) (L+L) 

112 

ll+l+l 

16 

(  (L+DL+LL)  (L+L) 

104 

16 

(LLL+LL) (L+L) 

106 

LIOa+L 

16 

( (L+L) L+L+L) (L+L) 

112 

(L+L+L) L 

16 

(LLL+L+L) (L+L) 

112 

(LL+L) L 

16 

(LL+LL+L) (L+L) 

104 

(L+L) LL 

16 

(LL+L+L+L) (L+L) 

106 

T.T.T.T. 

16 

(L+L+L+L+L) (L+L) 

112 

(L+L) (L+L) 

14 

( (L+L) L+L) (LL+L) 

106 

LL+LL 

14 

( (L+L) L+L) (L+L+L) 

104 

(LLL+L) (LL+L) 

106 

5 

(LL+L)  (L-fL) 

28 

(LLL+L) (L+L+L) 

106 

tL+L+L) (L+L) 

28 

(LL+LL) (LL+L) 

106 

6 

(LL+L) (LL+L) 

54 

(LL+LL) (L+L+L) 

102 

(LL+L) (L+L+L) 

54 

(LL+L+L) (LL+L) 

100 

(L-fL+L)  (L+L+L) 

52 

(LL+L+L) (L+L+L) 

102 

(L+L) (L+L) (L+L) 

52 

(L+L+L+L) (LL+L) 

106 

(LL+LL) (L+L) 

56 

(L+L+L+L) (L+L+L) 

106 

(LL+L) (L+L) (L+L) 

106 

(L+L+L) (L+L) (L+L) 

102 

6(L,  )’s  are  listed  will  be  referred  to  as  the  Coverage  Table. 
Table  III  shows  the  Coverage  Table  in  a  compact  form  (see 
Section  IV-C)  for  all  /.-expressions  of  size  7  and  less.  Various 
ramifications  of  this  table  are  considered  later  in  the  paper. 

B.  0(L,)  Calculation 

To  determine  0(LV),  we  give  the  following  algorithm  in 
which  the  reader’s  familiarity  with  the  cubical  complex 
notation  [21],  [22]  is  assumed. 

Coverage  Algorithm: 

Input:  Lc,  an  L-expression  consisting  of  v  L's. 

Output:  6(Lr),  the  greatest  lower  bound  on  the  minimum 
coverage. 

Comments:  This  algorithm  uses  two  sets  of  c-dimensional 
cubes  denoted,  respectively,  as  D°(Lt )  and  DHL,).  These  sets 
are  determined  by  a  recursion.  This  recursion  will  at  times 
utilize  the  /•-dimensional  unit  cube  Sr  =  ( xx  ■  x).  r  >  1. 

r 

Two  families  of  subsets  of  D°(L,)  and  D'(LV)  are  next  found, 
and  magnitudes  of  two  sets  of  representatives  (SR’s)  for  these 
families  finally  provide  the  required  0(L,).  A  set  of  repre¬ 
sentatives  of  a  family  of  sets  of  cubes,  say  |C’,  C2,  ,  Cn|  is 

defined  as  a  set  of  cubes  C  =  jci.  c2.  ,  cr|  such  that  for  each 

set  C'.  I  <  i  <  n,  there  exists  at  least  one  element,  say,  Cj  in 
C  which  also  belongs  to  C. 

Method: 

Step  I:  If  r  =  I , set  D°( L, )  =  (0)and  DHL)  =  (l).Go 
to  Step  4. 

Step  2:  if  c  >  2,  let  Lr  and  Lq.  p  +  q  =  v.  be  such  that  L , 
=  Lp  +  Lq  or  L,  =  LrLq.  Furthermore,  assume  that  D°(LP), 


D](LP).  D°(Lq),  and  D'(Lq)  are  known  by  previous  recursive 
use  of  this  algorithm. 

Step  3(a) :  If  L,  =  Lp  +  Lq.  set  D°(L ,.)  =  D°(LP)  X 
D°(Lq )  and  D'(L,.)  =  (D'(LP)  X  Sq)  u  (S,,  X  D'(Lq )). 

Step  3(b):  If  Lv  =  LpLq,  set  D°(L,)  =  (D°(LP)  X  Sq)  u 
(S„  X  D°(Lq))  and  D'(LV)  =  D'(LP)  X  D'(Lq). 

Step  4:  Let  D°(LV)  be  the  set  of  all  those  cubes  of  D°(L, ) 
each  of  which  has  a  “0”  in  the  /th  coordinate  position,  where 
/  =  1 , 2,  .  c  Similarly,  let  D)  (/.,,)  be  the  set  of  all  those  cubes 
of  DHL,)  each  of  which  has  a  1  in  the  fth  coordinate  position. 
Moreover,  let  P°(LV)  denote  an  SR  of  the  family  [D?(L, ). 
D°2(L,).  ,  />®(L,.)|  and  P'(Lr)  an  SR  of  the  family  |/>j(L,.), 

DlfLu).  .  D].(Lr)\.  Finally,  let  ||C||  denote  the  number  of 
0-cubes  in  a  set,  C,  of  cubes. 

Step 5:  Let  )  bean  SR  such  that  |  |Pm,„(L,.)] |  < 

||/>0(Z.,.)||  for  every  SR  P°(L,  ).  Similarly,  let  P'mm(L,  )  be  an 
SR  such  that  1 1 Pmin(L,' ) 1 1  <  ||P,(Z,.)||  for  every  SR 

PHUY 

Step6:6(L,)  =  | \Pl,n(L,  )|  |  +  |  )P'mM(Ll.)\). 

Stop. 

The  following  example  illustrates  the  algorithm. 

Consider  Lv  =  (L  +  L)(L  +  L),  c  =  4.  Letting  Lp  =  L  +  L 
and  Lq  =  L  +  L,  we  first  determine D°(L  +  L)  and  D’(L  +  L) 
and  then  use  these  sets  to  determine  D°(L,  )  and  /)’(/,  ).  Thus, 
from  Step  3(a) 

D°(L  +  L)  =  D°(L)  X  D°(L) 

=  (00) 

and 

DHL  +  L)  =  (DHL)  XS|)u(S,X  D'(L) 

=  (lx.xl). 

Now  using  Step  3(b)  next,  we  have 


mmmmm 


ACARWAl  AM)  MASSON  TABl.l  IOOK-IJI*  COVT.RAOI  BOl'NIAIM. 

D°(Lr)  =  (Z>°(Z.  +  L)  X  S2)  u  (S2  X  Z)°(Z.  +  L )) 

=  (OOxx.  xxOO) 
and 

D'{L,)  =  D'(L  +  L)  X  D'(L  +  Z.) 

=  (lxlx.  Ixxl,  xl  lx,  xlxl). 


2<n 

one-half  of  these  |Z-(i)|  -  2|  L(v  -  1)|  /.-expressions.  Inother 
words,  it  is  sufficient  to  list  in  the  tabic  entries  for  only 
-  |  L(v  —  1)|  of  the /.-expressions  of  size  u.  By  Lemma  2,  this 
means  we  need  to  explicitly  list  only  22  percent  of  all  the  entries 
for  L(v). 


Moving  on  to  Step  4  yields 

/>?(/.„)  =  />?(/.„)  =  (OOxx) 

Z>?(U  =  D°t(Lv)  =  (xxOO) 

/>!(/.„)  =  (lx lx,  I xx I ) 

D[(LV)  =  (xl  lx,  xlxl) 

D\(LV)  =  (lx  lx,  x  1  lx) 
and 

D\(Le)  -  (Ixxl. xlxl). 

Note  that  there  is  only  one  SR  of  the  family  |Z>®(Z.t,). 
Oft/.,,))  namely,  /*!,,„(/.„)  =  (OOxx,  xxOO).  However,  there  are 
various  SR’s  of  the  second  family.  Examining  the  possibilities 
shows  that  the  two  candidates  for  P'min(L0 )  are  ( 1  x  1  x,  x  1  x  1 ) 
and  (xl  lx,  Ixxl).  Thus,  we  finally  get  that 

#(£.  )=  II/’LMI  +  I  I Pmin( Li' ) | } 

=  7  +  7 
=  14. 

From  this  example  (and  anticipating  our  proofs  in  the  fol¬ 
lowing  section  that  8(L,  )  so  calculated  does,  indeed,  have 
Properties  PI  and  P2),  we  can  conclude  here  that  for  any  set 
of  4  lines  which  are  generically  represented  by  ( L  +  L)(L  + 
L)  in  any  internal  fan-out-free  network,  any  set  of  input  vectors 
which  detects  all  of  the  8  single  faults  on  these  lines  also  detects 
at  least  14  of  the  16  multiple  faults  involving  all  4  of  these 
lines. 

It  should  be  cited  that  the  most  difficult  aspect  of  the  im¬ 
plementation  of  the  Coverage  Algorithm  is  clearly  the  ex¬ 
haustive  step  of  determining  P°mm(U  )  and  P'm,„(Lt  ).  However, 
since  each  d(Lv)  entry  in  the  Coverage  Table  must  be  calcu¬ 
lated  only  once,  such  complexity  is  amortized  by  its  general 
use.  Moreover,  this  situation  is  alleviated  somewhat  by  the 
consideration  of  the  following  subsection. 

C.  Compacting  the  Coverage  Table 

Note  from  Table  II  that  there  are  24,  66,  and  180  L-ex- 
pressions  of  sizes,  5, 6,  and  7.  respectively.  Yet,  for  these  three 
sizes  only  2,  9,  and  24  /.-expressions,  respectively,  have  been 
listed  in  Table  III.  This  is  a  result  of  the  following  two  lemmas, 
the  proofs  of  which  are  om  tted  here  but  are  directly  based  on 
the  Coverage  Algorithm  [  1 7].1 

Lemma  3:  If  L,  =  L,—  \L  (or  L,  =  L,—  \  +  L),  then 

0(1,)  =  0(L,-,)+  2*-'. 

Lemma  4: 0(L, )  =  0(/.f).  where  Li  is  the  dual  /.-expression 
of  /.,. 

From  Lemmas  1  and  3  it  follows  that  for  each  set  Z.(t>),  it 
is  sufficient  to  calculate  8(Lr),  LrtL(v),  for  only  |Z.(o)|  — 
2|L(r  -  1)|  /.-expressions  of  sizec.  Moreover,  from  Lemma 
4  it  follows  that  d(L, )  must  actually  be  calculated  for  only 


1  For  illustrative  purposes  Lemmas  3  and  4  have  not  been  used  in  com¬ 
pacting  Table  III  for  L -expressions  of  si7e  4  and  less. 


D.  Applications  of  the  Coverage  Table 

The  most  straightforward  use  of  the  Coverage  Table  is,  as 
pointed  out  before,  to  determine  the  greatest  lower  bound, 
0(Z.,,),  on  the  number  of  fault  vectors  of  /•*  covered  by  any 
SFDTS  in  any  A  for  any  set  K  of  lines  with  which  the  associ¬ 
ated  /.-expression  is  L,,.  Thus,  for  example,  note  from  Table 
III  that  since 6(LV)  =  2*’  =  |Fx|  for  all  Ll,  v  <  3,  we  can  stale 
the  following. 

Lemma  5:  Given  any  SFDTS,  say,  Ts  for  any  internal  fan- 
out-free  network,  A,  Ts  covers  all  the  multiple  faults  of  sizes 
2  and  3  as  well. 

This,  of  course,  is  a  previously  known  result  1 10] -[  1 2]; 
however,  the  generality  of  the  Coverage  Table  approach  gives 
this  result  a  new  perspective.  To  illustrate  further,  note  from 
Table  III  that  any  SFDTS  for  jV  will  also  cover  all  the  multiple 
faults  of  size  4  except  for  the  cases  where  the  /.-expression 
associated  with  a  set  AT  of  4  lines  is  either  LL  +  LL  or  the  dual 
(Z.  +  L)(L  +  L).  Moreover,  even  in  these  two  cases,  it  can  be 
seen  from  the  table  that  at  least  14  out  of  16  fault  vectors  of 
Fk  will  necessarily  be  covered  by  each  SFDTS.  A  similar  ob¬ 
servation  about  /.-expressions  of  size  5  together  with  the  above 
observation  leads  to  the  following: 

Lemma  6:  Given  any  SFDTS,  say,  T,  for  any  internal  fan- 
out-free  A,  Tj  covers  at  least  1 4  out  of  1 6  fault  vectors  on  any 
set  of  4  lines  and  at  least  28  out  of  32  fault  vectors  on  any  set 
of  5  lines  in  A. 

Similar  observations  could,  of  course,  be  made  regarding 
multiple  faults  of  larger  sizes. 

Another  interesting  type  of  use  of  the  Coverage  Table  is  to 
perform  what  we  will  call  coverage  averaging.  This  involves 
the  use  of  the  Coverage  Table  to  determine  the  average  ca¬ 
pability  of  an  SFDTS  to  cover  all  the  multiple  faults  in  a  given 
network  of  some  predetermined  size.  To  do  this  we  make  the 
assumption  that  each  /.-expression  LeL(v)  is  equinumerous 
in  A.  We  can  then  calculate 
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AVG  [0(Z.,.)J  = 


L,.(L{l) 

2*iKtoi 


This  quantity  represents  the  average  value  of  8{L,  )  over  all 
0(Z.,.)’s  of  fixed  size  v,  divided  by  21’.  With  an  equinumerous 
assumption,  we  can  then  conclude  that  in  any  network  A,  on 
the  average  AVG[0(Z.,.)]  of  all  multiple  faults  of  size  v  will 
necessarily  be  covered  by  every  SFDTS  for  A.  A  graph  which 
shows  the  values  of  AVG|0(Z., )]  for  all  v  <  8  is  given  in  Fig. 
2.  The  following  theorem  is  a  direct  conclusion  from  this 
figure. 

Theorem  I:  On  the  average,  an  SFDTS  in  an  internal  fan- 
out-free  network  covers  92  percent  of  all  multiple  faults  of  sizes 
8  or  less. 

Clearly,  the  validity  of  this  theorem  depends  on  the  validity 
of  the  equinumerous  assumption  mentioned  above;  however. 
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Fig.:,  AVG|fl(i, ))  for<  <  8. 


consideration  of  large  general  internal  fan-out-free  A  indicates 
that  the  assumption  is  not  unduly  restrictive  and  that  for  many 
networks  of  interest  this  theorem  is  quite  applicable.  Moreover, 
if  the  equinumcrous  assumption  were  not  valid  for  a  network, 
a  weighted  coverage  averaging  could  be  performed. 

Now,  this  theorem  together  with  some  recent  results  by 
Goldstein  (16]  represent  a  quantitative  affirmation  of  the 
common  speculation  that,  at  least  in  internal  fan-out-free 
networks,  SFDTS’s  cover  most  multiple  faults  as  well.  More 
specifically,  Goldstein  [  1 6]  has  shown  by  exploiting  statistical 
information  relating  to  physical  defects  and  chip  layout  data 
that  for  most  networks  consideration  of  multiple  faults  up  to 
size  8  is  sufficient  for  multiple  fault  analysis.  Of  course,  for 
networks  where  the  consideration  of  at  most  8  faults  is  not 
sufficient.  Theorem  I  could  be  accordingly  extended. 

A  very  useful  result  obtained  by  coverage  averaging  under 
the  equinumerous  assumption  for  la.ge  networks  is  the  fol¬ 
lowing  theorem. 

Theorem  2:  For  all  v  >  I,  AVG[0<£, )]  >  0.461 

A  proof  of  this  theorem  using  the  asymptotic  nature  of  0(Lt) 
is  derived  in  [17].  The  implication  of  the  theorem  however  is 
that  on  the  average  at  least  46. 1  percent  of  all  multiple  faults 
are  covered  by  each  SFDTS  in  an  internal  fan-out-free  net¬ 
work.  This  is  the  first  quantitative  result  which  provides  an 
estimate  of  the  capability  of  SFDTS’s  to  cover  all  multiple 
faults  in  networks  which  are  more  general  than  restricted 
fan-out-free  networks  [13],  the  largest  class  of  networks  for 
which  it  is  known  that  every  SFDTS  covers  all  multiple 
faults. 

V.  Justification  of  0(L, ) 

In  this  section  Properties  PI  and  P2  of  #(£,-)  which  were 
described  in  the  previous  section  and  which  justify  our  asso¬ 
ciated  claims  regarding  6(LV)  will  be  considered.  PI  will  be 
proved  in  Section  V-A  and  a  procedure  to  construct  a  network 
for  any  Lv  such  that  P2  holds  will  be  given  in  Section  V-B.  In 
the  following,  our  arguments  for  convenience  and  with  no  loss 
of  generality  will  be  developed  under  the  assumptions  that  the 
set  K  of  lines  being  considered  is  such  that  each  Y,(  YK  appears 
in  Z\X\  K*]  in  the  uncomplemented  form  only,  and  that  the 
set  K  =  |1. 2,  •  ,i>|. 

We  start  with  the  following  two  definitions  which  were  in¬ 
troduced  in  [10]. 

Definition  3:  Given  a  set  K  of  lines  in  N  and  an  input  vector 
Xj  to  TV,  a  fault  vector  a*tFA  will  be  said  to  be  a  (0  -*•  /)- 
tested  fault  vector  for  X,  if  Z[X T*]  =  0  and  Z(A,;  aK  ]  = 
I.  Similarly,  a  fault  vector  a* (F*  will  be  said  to  be  a  (1  — 


0)-tcstcd  fault  vector  for  A",  if  Z)X,:  YK]  =  1  and  7.\X,\  a*-] 
=  0. 

Definition  4:  Given  an  input  vector  X,  to  A',  a  fault  vector 
«a<Fa  will  be  said  to  be  a  /3-consistcnt  fault  vector  for  X, 
relative2  to^A";  YK\  if -4!*,;  K*]  =  A\X,\ «A]  =  /9.  /?< [0, 
1|. 

Clearly,  a  ^-consistent  fault  vector  is  such  that  the  Boolean 
value  of  A  [A-,  YA\  for  X,  is  not  changed  by  the  presence  of  a*. 
For  example,  if  A\X:  Yk)  =  X\  Y\  +  X2K2  for  K  =  [1, 2],  and 
Ti  =  *3  and  Y 2  =  x4,  then  =  (0,  1)  isO-consistent  for  A", 
=  (1001 ).  Similarly,  the  fault  (0,  0)  is  also  seen  to  be  0- con¬ 
sistent  for  A j. 

It  is  shown  in  [10]  that  the  concepts  of  Definitions  3  and  4 
are  of  fundamental  importance  in  determining  the  exact 
coverage  of  fault  vectors  of  F \  by  input  vectors  which  are 
known  to  cover  fault  vectors  of  F;  and  Fj,  where  /  and  J  form 
a  partition  of  K.  In  this  paper,  we  use  the  same  concepts  instead 
to  obtain  the  greatest  lower  bound  on  the  exact  coverage.  The 
main  difference  between  the  two  approaches  is  that  whereas 
in  [10]  the  complete  functional  information  contained  in  Z[A'; 
Yk)  is  used  to  calculate  the  exact  coverage,  we  use  in  this  paper 
only  the  information  contained  in  the  associated  /.-expression 
to  calculate  the  bound.  To  illustrate  the  difference,  consider 
a  set  of  K  of  lines  such  that  Z\X,  VA]  =  H[X\  Y,  ]  +  M[X\  Yj\ 
+  0[X].  Let  a/tFi  be  a  (1  — ►  0)-tested  fault  vector  for  some 
A /.  In  other  words,  Z|Aj;  K/]  =  I  and  Z[X,\  £*/]  =  0.  Since 
Z[X\  Y /]  is  the  Boolean  function  realized  by  A.  it  is  the  same 
as Z[A';  >'*]  =  H[X:  Y,\  +  M\X ;  Yj)  +  0)X]  ifall  Y,tYj  in 
M[X\  Yj\  are  replaced  by  their  equivalent  primary  input 
Boolean  functions.  Thus,  it  follows  that 

H[Xr.  Y,\  +  M[Xj\  Yj]  +  0[Xj]  =  1 

and 

H\X,\  «,]  +  M[Xr.  Yj)  +  0{X,]  =  0. 

That  is,  H[Xr.  Y/)  =  1.  H[X,\  a,)  =  0,  M\X,\  Yj )  =  0,  and 
0)Xj]  =  0. 

Next,  let  ( xjiFj  be  a  0-consistent  fault  vector  for  A relative 
to  M[Xi\  Yj):  that  is,  M[X,\  Yj]  =  M[X,\ a  j)  =  0.  It  is  then 
easily  seen  that  «A  =  ( «/ .  07)  is  also  a  (1  — *■  0)- 
tested  fault  vector  for  A',.  In  other  words,  we  have  that  if  a/  is 
a  ( 1  —  0)-tcsted  fault  vector  for  A',,  aj  is  any  0-consistent  fault 
vector  for  AT,,  and  Z[X;  YA  )  =  H[X\  Y,)  +  M\X\  Yj)  +  0[X), 
then  a*  =  (a/,  aj)  is  also  a  (1  — *■  0)-tested  fault  vector  for  A",. 
More  specifically,  if  we  define 

Ci0(*,)  =  I«a|Z[A',;  Yk)  =  1  and  Z[Af,;  «*■]  =  0| 

and 

hj(X,)  =  \aj\M[Xr,  Yj]  =  At[Xr.  <*j}  =  0], 
then  for  the  above  example  we  have  that 

f'x>(A',)  =  G}°(Aj)  X  F*j{Xj).  (5.1) 

By  recursively  using  relations  such  as  (5.1)  above,  it  is 

3  A\X;  >  A  1  is  used  to  denote  a  general  Boolean  expression  in  the  fan-out-free 
form  in  terms  of  the  variables  of  A'  and  )\.  In  particular.  A  (A;  >Y  I  could  be 
the  same  as  Z|AV  K*  J. 
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possible  [10]  lo  determine  the  exact  coverage  of  FK  by  an 
SFDTS.  This  approach,  however,  is  not  very  practical  because 
sets  such  as  F5(Af, )  are  highly  dependent  on  the  particular  X„ 
J.  and  M[X\  Yj ]  under  consideration. 

It  turns  out  though,  as  will  be  shown  in  the  following,  that 
it  is  possible  todetermine  a  subset  of  F$(. V,-)  simply  by  knowing 
the  /.-expression  associated  with  J.  That  is,  the  generic  in¬ 
formation  of  logical  interconnection  of  line  variables  of  Yj  in 
M\X:  Yj]  can  be  suitably  exploited  to  obtain  a  subset  of 
Fj(X,).  Since  such  generic  information,  namely  the  associated 
/.-expression  with  J.  is  independent  of  the  particular  A,,  J,  and 
M\X:  Yj]  under  consideration,  such  subsets  can  clearly  be 
used  in  infinitely  many  different  situations. 

Furthermore,  it  is  also  possible  to  show  that  a  subset  of 
Gj°(A', )  can  be  obtained  simply  by  know  ing  the  /.-expression 
associated  with  /.  Using  these  two  results,  it  then  follows  from 
(5.1)  that  a  subset  of  can  be  obtained  by  using  the 

/.-expression  associated  with  K.  In  fact,  the  Coverage  Algo¬ 
rithm  of  the  previous  sectioji  does  exactly  this,  as  we  shall  prove 
in  the  following  two  lemmas.  It  should  now  be  clear,  however, 
that  the  greatest  lower  bound  on  the  exact  coverage  of  F *•  is 
directly  related  to  such  subsets  of  Gj'(X)  and  G*®(.Y,).  X,(  T,. 
We  will  consider  more  details  of  this  in  Sections  V-A  and 
V-B 

Before  presenting  the  two  lemmas,  it  will  be  convenient  lo 
introduce  some  notation  here.  Because  of  the  generic  approach 
we  are  taking,  we  will  represent  fault  vectors  and  fault  com¬ 
plexes  in  the  following  by  using  the  same  cubical  complex 
notation  as  is  used  in  the  Coverage  Algorithm.  In  this  notation, 
a  fault  vector,  say,  aA  will  be  considered  as  a  /'-dimensional 
O-cube.  The  fault  complex  FA  will  then  simply  be  the  i  -di¬ 
mensional  unit  cube  (xx  x).  Moreover,  a  subset  of  F*  is 
then  represented  as  a  set  of  various  cubes  in  the  /  -dimensional 
unit  space.  The  use  of  these  notations  will  be  clearer  as  we 
proceed. 

Lemma  7:  Let  K  -  |1, 2,  ■  •  ,  r|  be  any  set  of  v  lines  in  N  and 
let  A[X:  Yk]  be  a  fan-out-free  form  Boolean  expression. 
Moreover,  let  L,  be  the  /.-expression  associated  with  K  relative 
to  A[X;  Yu],  Then  given  any  A'„  A  (A",;  FA  J  =  0,  there  exists 
at  least  one  cube,  say,  ciD8(L, )  such  that  the  set  of  O-cubes 
of  c  is  contained  in  or  equal  to  F$(X,).  /3r|0,  I). 

Before  we  enter  into  a  formal  proof,  the  scope  of  this  lemma 
should  be  emphasized  again.  Recall  that  in  the  Coverage  Al¬ 
gorithm,  the  set  of  cubes  DB{Lr)  is  determined  with  no  refer¬ 
ence  to  a  particular  network  /V,  or  a  particular  set  K  of  lines, 
or  a  particular  input  vector  A Indeed,  we  only  considered  a 
set  of  r  lines  in  any  A’  whose  logical  interconnection  was  gen¬ 
erally  represented  by  L,  .  Yet,  in  spite  of  the  generality  under 
which  Ds(Lt  )  is  generated,  we  nevertheless  are  stating  in 
Lemma  7  that  this  set  of  cubes  contains  a  subset  of  FBK(Xj) 
where,  by  definition,  in  this  latter  set  we  are  referring  to  par¬ 
ticular  /V,  K,  and  X , — and  this  result  holds  regardless  of  the 
particulars.  That  is,  it  holds  for  each  of  the  infinite  number  of 
cases  in  which  L ,  can  be  associated  with  some  set  of  lines  in 
an  internal  fan-out-free  network.  Hence,  Lemma  7  makes  a 
fundamental  connection  between  the  generic  and  the  specific 
which  is  at  the  very  center  of  all  our  results. 

Proof:  Let  v  =  1  -  Then  K  =  1 1 1  and  A  [X:  F*]  =  A ,  [X]  F, 


+  .d;|A')  for  some  A  i|A')  and  A 2IA'].  Given  now  that  .4 1 A , ; 
FA]  =  A i|A',]>'i  +  A2\X,]  =  0  (or  1)  for  some  A',,  it  imme¬ 
diately  follows  that  Ai[X,]ot)  +  A2\X,]  =  0  (1 )  for  rr ,  =0(1). 
That  is.  Fa  (A-,)  3  \0\.  Recalling  now  from  the  Coverage  Al¬ 
gorithm  that  DB(L, )  =  (0)  for  c  =  1,  we  have  that  Lemma  7 
is  true  for  v  =  I . 

Assume  now  that  the  lemma  is  true  for  every  set  K  of  1 , 2, 

•  •,  and  v  —1  lines,  where  v  >  2.  Based  on  this  assumption,  we 
will  now  prove  that  it  is  also  true  for  every  set  K  of  v  lines. 
Thus,  let  K  be  a  set  of  r  lines  and  Lv  be  the  /.-expression  as¬ 
sociated  with  K  relative  to  A[X;  FA].  Let  l  and  J  form  a 
partition  of  A  such  that  ^4  (A';  FA]  =  (A\[X;  Y/]*A2  [X:  >0)) 
+  0[X\.  Then,  clearly,  L ,  =  Lp  *  Lq,  where  Lp(Lq)  is  the 
/.-expression  associated  with  1(J)  relative  to  A,[X-,  Y/]{A2\X: 
Yj]).  Wc  will  prove  the  lemma  only  for  the  case  where  “»"  is 
“+"  and  0  =  0.  The  proof  for  the  remaining  cases  follows 
analogously. 

Thus,  let  X,  be  such  that 

A |A,;  Fa  )  =  A0X,-  Y, ]  +  A2\Xr.  Yj]  +  0\XA  =  0 
Then  clearly, 

A\[Xr*  F/]  =  AAXr,  Yj]  =  0[Xt]  =  0.  (5.2) 

Now,  let  aj  be  a  0-consistent  fault  vector  for  X,  relative  to 
As\X\  Y/ ):  that  is,  let  «/eF?(.Y,).  Similarly,  let  otjtF'jf X,). 
Then  by  Definition  4, 

A  |  [A,:  «/]  =  Oand  A2\X,:  «,]  =  0.  (5.3) 

Using  (5.2)  and  (5.3).  it  is  then  easily  seen  that  if  a*  =  (a/, 
(xj).  then  since 

A[X,:  «a]  =  -4,|A'(;  «/]  +  A2[Xi\  ay)  +  0[Xt]  =  0. 

Therefore  «A  must  be  a  0-consistcnt  fault  vector  for  A',  relative 
to  A  [A,;  Fa  ].  In  other  words,  we  have  proven  that  if  a/fF?(A',) 
and  ajtF°j(X,).  then  «a'CFa(A,).  That  is, 

F°(Xj)  X  Fj(Xj)  s  Fj(A,).  (5.4) 

Consider  now  the  sets  D°(LP )  and  D°(Lq).  By  our  induction 
hypothesis  we  have  that  there  must  exist  at  least  one  cube,  say, 
c\(D°(Lp)  and  another  cube,  say  c2(D°(Lq)  such  that  the  0- 
cubes  of  C|  and  c2  are  contained  in  F®(A,)  and  Fj{Xj),  re¬ 
spectively.  It  then  follows  from  (5.4)  that  the  0-cubes  of  the 
cube  (c|f2)  will  belong  to  Fa(A,).  However,  from  Step  3(a) 
of  the  Coverage  Algorithm  it  is  seen  that  (r  ,c2)tD°(Lt  ). 

Q.E.D. 

Lemma  8:  Let  K  =  ]  1 , 2,  ,  c|  be  any  set  of  v  lines  in  N  and 

let  Lv  be  the  /.-expression  associated  with  K.  Moreover,  let  A', 
be  an  input  vector  which  (0  — *  I  )-lesls  (or  (1  — *  0)-tests)  the 
single  fault  a,  =  1  (correspondingly,  a,  =  0),  tf  K.  on  the  lines 
of  K.  Then  there  exists  at  least  one  cube,  say, 
«/>,'(/,, )(«/>?(/.,.))  such  that  the  set  of  0-cubes  of  r  is  con¬ 
tained  in  or  equal  to  Ga'(A’,)((7a®(A',)). 

Lemma  8,  like  Lemma  7,  makes  a  fundamental  connection 
between  the  generic,  D?(Lr),  /3<|0, 1  j,  and  the  specific,  G®'(A',-) 
and  C;°(A',). 

Proof:  First  of  all  note  that  because  of  our  assumption 
that  each  F,€Fa  appears  in  Z[X ;  FA  ]  in  the  uncomplemented 
form,  it  follows  that  the  s-a-0  fault  on  any  single  line  u K  (that 
is,  the  fault  ex,  =  0)  will  always  be  ( 1  — *•  0)-tested  by  any  A', 
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which  tests  this  fault.  Similarly,  the  fault  a,  =  I  will  always 
be  (0  -*  1  )-tested  by  each  A',  which  tests  it.  In  other  words, 
G|'®(A',)  =  (0)  and  G®j(A',)  =  (1)  for  every  ttK. 

T o  prove  the  lemma  now,  let  K  =  1 1 )  and  v  =  1 .  Then  since 
/)!(£,)  =  ( 1 )  and  /)“(/*,)  =  (0)  from  the  Coverage  Algorithm, 
and  since  G*  (A/)  =  (1 )  and  Gi°(Af,)  =  (0)  as  discussed  above, 
the  lemma  is  clearly  true  for  v  =  1 . 

Assume  now  that  the  lemma  is  true  for  all  sets  of  1 , 2,  , 

v  —  1  lines.  Consider  then  a  set  K  =  1 1,2,  ,n|  oft)  lines.  Let 
/  and  J  form  a  partition  of  K.  where  /  =  |1, 2,  ,p|and  J  = 
|p  +  1 ,  p  +  2,  •  •,  P‘  +  ?},  p  +  Q  =  v.  Let  L,  =  Lp  *  Lq  be  the 
/.-expression  associated  with  K  such  that  LP[LW)  is  the  L- 
expression  associated  with  l(J).  We  will  now  prove  the  lemma 
for  the  case  wherein 

Z[X-  Yk]  =  H[X\  Y, ]  +  M[X-  Yj }  +  0\X).  (5.6) 

Xj  ( 1  -*  0)-tests  a,  =  0,  and  id.  The  proof  for  the  remaining 
cases  follows  in  a  similar  manner.  Recall  here  that  for  the  case 
mentioned  above  we  have  already  seen  in  (5.1 )  that 

GjftAf,)  =  Gj«(Af,)  X  FS(*,).  (5.7) 

Moreover,  since  X,  ( 1  — »  0)-tesls  a,,  td.  it  follows  from  our 
induction  hypothesis  that  there  exists  a  cube,  say,  c '«/)?(/.,,) 
such  that  all  the  0-cubes  of  c1  are  contained  in  G}°(X,).  Also, 
from  Lemma  7  we  have  that  there  exists  a  cube,  say,  c2tD°(Lv) 
such  that  all  the  0-cubes  of  c 2  are  contained  in  Fj(Xj).  Thus 
it  follows  from  (5.7)  that  all  the  0-cubes  of  the  cube  c  =  (c  V2) 
will  be  contained  in  Gj^A-,).  However,  from  the  Coverage 
Algorithm  we  know  that  ctD°(Lv)  since  D°(L ,  )  =  D°(LP)  X 
/>«(/.,).  Q  E  D. 

A.  Proof  of  Property  PI 

Consider  an  SFDTS,  T„  for  A  and  let  K  be  any  set  of  v  lines 
in  N.  The  exact  capability  of  Ts  to  cover  fault  vectors  of  /> 
can  be  quantitatively  expressed  as 

G*(7,)=  U  G«(X.) 

Xt,T, 

where  GK(Xj)  is  Gj'fA",)  or  Gj(0(A',  )  depending  on  Xj.  Clearly, 
|G*(7'J)|  =  <t>{K.Ts),  as  defined  in  Section  IV.  Using  Lemmas 
7  and  8,  we  now  show  in  this  section  that  certain  subsets  of 
Gx(X,  ),  X,(T„  when  combined  in  the  specific  manner  detailed 
in  the  Coverage  Algorithm  lead  to  a  number,  6(Lr).  which 
satisfies  PI. 

Theorem  3:  Given  a  set  A  “  j  1, 2,  •••,»)  of  n  lines  in  A  and 
the  minimum  coverage  <t>{K)  of  F*.  by  every  SFDTS  for  A,  the 
integer  6(L,  )  obtained  by  the  Coverage  Algorithm  is  such 
that 

0(LV  )  <  4>(K). 

where  Lv  is  the  /.-expression  associated  with  K. 

Proof:  Let  T,  be  an  SFDTS  of  A  such  that 

MK)  =  4>(K,TS)  =  \Gk(T,)\.  (5.8) 

Since  there  are  2d  single  faults  on  the  lines  of  K ,  there  must 
exist  at  least  one  input  vector  in  T,  corresponding  to  each  of 
these  2d  faults.  To  be  more  specific,  let  X*lxtTs.  f  =  1,2,  , 

v,  be  an  input  vector  which  (0  — »  1  )-tests  the  single  fault  a,  = 
1;  and  let  X',°(T3.  t  -  1, 2,  ,  r  be  an  input  vector  which  (1 
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-*•  0)-tests  the  single  fault  a,  -  0,  where,  of  course,  all  these 
vectors  might  not  be  distinct  vectors.  Recalling  that 

(U  MX,)  =  Gk(T ,) 

for  aft 
X,tTl 

it  is  then  clear  that 

,U  GM1)  U  G'M0)  s  G/rfT,).  (5.9) 

for  all  for  all 

it  A  it  A 

Consider  now  Lemma  8.  For  each  u K.  let  c?1  be  a  cube  in 
D}{LV).  all  the  0-cubes  of  which  are  contained  in  G^fA?1). 
Similarly,  let  r,10.  ttK.  be  a  cube  in  D?(/.„),  all  the  0-cubes  of 
which  are  contained  in  Gi^A',10).  Then,  it  follows  from  this 
lemma  that 

{0-cubes  in  |c?\  cf.  .  c®1))  s  U  Gj,(^").  (5.10) 

for  all 
uK 

and 

{0-cubes  in  \c)°.  c'2°.  .  cj° ||  s  U  G'k0(X!°). 

for  all 
it  A 

(5.11) 

Clearly,  now,  {c®1,  c®1.  ,  c,0l|  is  an  SR,  say,  F*01  (i*,.)  of  the 
family  \D\{LV).  /){(/»).  ,  /),!(/.,)).  and  |r!VJ0,  •  •,c/,®|  is 
an  SR.  say,  P'°(LV)  of  the  family  |D?(/-„),  />?(/.„).  , 

D®(Lt,)).  Therefore,  we  have  from  (5.10)  and  (5.1 1)  that 

||/>0,(L„)||  <  |  UG£,(Af®,)|  (5.12) 

for  all 
uK 

and 

||/»°U1,)||  <  |  U11C|c°(A’f,0)|.  (5.13) 

for  all 
it  A 

In  other  words,  using  (5.12)  aid  (5. 13)  in  (5.9),  we  get  that 
||/>0l(Z.l,)||  +  ||/»,0(£r)||  <  |Cic(7'J)|.  (5.14) 

Finally,  recalling  from  the  Coverage  Algorithm  that 
I  |f>min(^-i')|  |  +  ||I*|S.(W||. 

where  | |/^jn(/-t,)| |  <  ||/>0,(i,)||  and  ||PL®n(/.,)||  < 
|{/>,0(/.r)||  and  using  (5.8)  and  (5.14),  it  is  clear  that 

0(L, )  <  0-E.D. 

B.  Property  P2 

Given  any  /.-expression  L,  we  will  in  the  following  first 
present  a  procedure  which  will  associate  with  L,  a  specific 
network,  denoted  as.  A(/,,  ).  A  set  of  lines  will  next  be  shown 
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to  exist  in  JV(Z., )  such  that  the  /.-expression  associated  with 
K  is  /.,..  FinalJy,  an  SFDTS.  Ts,  for  NIL,)  will  be  generated 
with  the  property  that  <t>(K.Ts )  =  4>(K)  =  (i(Lv).  For  the  sake 
of  brevity,  no  proofs  are  provided  in  this  section;  the  interested 
reader  can  request  (17]  for  all  the  pertinent  details. 

Let  L,  be  an  /.-expression  consisting  of  v  L's.  Then  a  net¬ 
work  of  class  Cl,  denoted  as,  N{LV)  which  has  no  primary 
input  fan  out  and  consists  of  3o  primary  input  variables  will 
be  associated  with  L,.  by  the  procedure  given  in  the  following. 
The  network  N(Lp)  will  be  constructed  by  this  procedure  in 
a  recursive  manner  by  using  two  networks,  say,  N(LP)  and 
N(Lq),  where  Lp  *  Lq  =  /.,,.  The  3o  primary  input  variables 
of  N(U)  will  simply  be  xi.  X2.  .  x3l,.  Similarly,  the  3 p  (3 q) 

primary  input  variables  of  N(Lp)  (N(Lq))  will  be  at i .  xi, 
xip  (correspondingly,  X|.  x2.  .  xj,).  However,  when  a  net¬ 

work,  say,  N(Lq)  is  used  as  a  subnetwork,  as  in  the  construc¬ 
tion  of  yV(Li:).  then  the  3 q  primary  input  variables  of  the 
subnetwork  N(Lq)  will  be  some  3 q  consecutive  x/s,  such  as, 
X$p+\,  X$p+2'  .  X}p  +  lq ■ 

The  Procedure  can  now  be  formally  stated  as  follows. 

Network  Construction  Procedure- 

Step  I:  If  v  =  1,  then  N(L,)  is  as  shown  in  Fig.  3(a). 
Stop. 

Step  2:  If  v  >  2,  let  v  =  p  +  q  such  that  Lv  =  Lp  *  Lq.  Fur¬ 
thermore,  assume  that  N(Lr)  and  N(Lq)  are  known  from 
previous  use  of  this  procedure. 

Step  3:  Let  the  primary  input  variables  of  N(LP)  be  x\,  x2, 

,  x2p.  and  that  of  N(Lq)  be  x3<,+  i,  x3/,+  2.  .  xip+2q. 

Step  4:  If  L,  ~  Ip  +  Lq(LpLq).  then  form  N(LV)  by  ORing 
(ANDing)  N(LP )  and  N(Lq)  as  shown  in  Fig.  3(b)  (see  Fig. 
3(c)).  Stop. 

In  the  following  we  will  assume  that  an  input  line  which  is 
connected  to  the  primary  input  variable  x2.  I  <  j  <  3t\  is 
labeled  with  integer  j.  Note  now  that  each  N(L,)  consists  of 
exactly  v  subnetworks  of  the  type  shown  in  Fig.  3(a).  More¬ 
over,  these  v  subnetworks  are  logically  interconnected  in  /V( L, ) 
in  the  same  manner  as  the  c  "L"  symbols  in  /*,.  Thus,  it  implies 
that  if  K  is  a  set  of  r  lines  in  N{L, )  such  that  there  is  exactly 
one  line  in  K  from  each  of  these  r  subnetworks,  the  /.-expres¬ 
sion  associated  with  the  set  K  would  be  Lr.  We  will  thus  be 
concerned  in  the  following  with  the  specific  set  K  =  |2,  5,  , 

3e  —  Ijofr  lines  in  N(Lt  ). 

We  now  present  a  test  set  generation  procedure  which  will 
generate  a  SFDTS,  Ts,  for  N(L, .)  such  that  QfK.T,)  =  Q(L,.). 

T,  will  consist  of  4r  input  vectors,  namely,  T,  =  \X^' .  Af?1,  X'2. 
vJO  yoi  yoi  yio  yio  yoi  yoi  yio  yio  yOi 

Ay  ,A  4  tA  5  ,A$  ,A{,  .  »  ^  3A-2»  A  3A- I*  *  3A- 1*  *  3**  •  ^  3« — 2* 

Xjr-i.  Xir-i.  A"3?|  where  each  A?' (A1/0)  is  an  input  vector 
which  (0  -»  l)-tests  ((1  -*  0)-tests)  the  single  line  i-s-a-l  (line 
/-s-a-0)  in  N(Lv).  It  will  be  assumed  that  the  sets  P „*„(/.,,)  and 
/*£,,„(/., ,)  from  which  6(LV)  was  calculated  are  given. 

Test  Set  Generation  Procedure: 

Do  Steps  1 ,  2,  and  3  for  /  =  1 ,  2,  •  •  ,  v. 

Step  I:  Let  c'(Pamin(L,  ).  /3({0,  IJ,  be  the  cube  which  repre¬ 
sents  the  set  /)?(/., )  in  /!,„(/, ).  Moreover,  assume  that  c'  = 

ic',c'2  c1,). 

Step  2:  For  a  given  Q,  do  the  following  for  /  =  1,2,  .  r.  The 
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Fig.  3.  Networks  for  the  network  construction  procedure. 

resulting  output  will  be  A^jJ-i  if  0  =  1,  and  A'i?-,  if  0  =  0. 

a)  If  c\  =  /3.  t  ^  i,  then  let  X3/-2  =  1,  X3/-1  =  0  and  x3, 

=  0; 

b)  If  c]  =  0,  t  =  1,  then  let  x3,_2  =  I,  X3/-1  =  0  and  x3) 

=  0; 

c)  If  c]  -  x.  then  let  X3/-2  =  X3,_|  =  X3,  =  0. 

Step  3:  For  a  given  /?,  do  the  following  for  /  =  1,2,  ,  v.  The 

resulting  output  will  be  X\^2  if  0  =  1 ,  and  A"3?  if  /?  =  0. 

a)  Same  as  Step  2a); 

b)  If  c'i  =  0,  t  =  i,  then  let  xv-2  =  0,  x3l_  1  =  P.  and  x3l 

=  0; 

c)  Same  as  Step  2c). 

Stop. 

The  test  set,  Ts.  obtained  by  the  above  procedure  is  an 
SFDTS  for  N(Lr).  and  is  such  that  <t>{K,Ts)  =  0(L„)  (17).  The 
following  example  illustrates  the  above  developments. 

Example:  Let  L,  =  (L  +  L){L  +  L).  The  network  N(L,  ) 
obtained  by  using  the  Network  Construction  Procedure  is 
shown  in  Fig.  4.  Let  K  =  )2,  5, 8,  11)  be  the  set  of  4  lines  of  in¬ 
terest.  Recall  from  the  Example  given  in  Section  IV  that  for 
L,  =  (L  +  L)(L  +  L ).  we  found  that  6(Lt  )  =  14,  and 

P  rninf^r  )  *  (Ixlx.xlxl) 

and 

FL, {Lt.)  =  (00xx,  xxOO). 

Completely  generating  T,  with  the  Test  Set  Generation  Pro¬ 
cedure  will  result  in  the  specification  of  Ac  =  16  input  vectors. 
We  will,  however,  illustrate  the  use  of  the  Test  Set  Generation 
Procedure  only  for  the  generation  of  one  input  vector.  The 
complete  Ts  can  be  generated  similarly  and  is  listed  in  (17). 

Thus,  consider  the  single  fault,  say,  ag  =  0  (that  is,  the  fault 
line  8-s-a-0).  Then  to  determine  Ag°  we  use  Step  2  of  the  pro¬ 
cedure  for  /  =  3  and  0  =  0.  Doing  this,  we  note  that  the  cube 
c',  t  =  3,  which  represent  D?(LV)  in  PmmfL,,)  is  (xxOO)  = 
(c fcjf  ici).  Therefore,  since  c\  =  x,  for  1  =  1 ,  we  have  from  Step 
2c) that 
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Fig.  4.  Network  for  the  example  in  Section  V-B. 


at i  =  1 ,  *2  =  1,  and  *3  =  1 . 

Similarly,  since  c\  =  x  for  /  =  2,  we  have 

*4  =  1 ,  *5  =  1 ,  and  JCf,  =  1 . 

For  /  =  3,  c'  -  0  and  i  =  t;  thus,  by  Step  2(b)  we  have 
*7  =  1,  x8  =  1,  and  xi)  =  0. 

Finally,  for  i  =  4,  we  get  from  Step  2(a)  that 
A  m  =  l,Xn  =0,  and  jc 1 2  =  0. 

That  is, 

A'i°  =  (1,  1,  I,  I,  1,  1,  1,  1.0,  1,0,0). 

The  procedure  can  similarly  be  applied  to  generate  the  re¬ 
maining  1 5  input  vectors. 

It  is  important  to  point  out  here  that  although  the  networks 
constructed  in  this  subsection  are  fan-out-free  networks,  it  is 
also  possible  to  construct  networks  which  have  primary  input 
fan  out  and  for  which  <?(/,.)  =  However,  because  the 

Network  Construction  Procedure  and  the  Test  Set  Generation 
Procedure  for  such  networks  are  much  more  complex,  we  have 
for  simplicity  considered  fan-out-free  networks  to  justify 
Property  P2. 

In  conclusion,  it  has  been  shown  in  this  section  that  given 
any  set  of  K  of  r  lines  in  any  internal  fan-out-free  network  A', 
the  greatest  lower  bound  on  the  minimum  coverage  <MK)  is 
simply  the  integer  0(Z, ),  where  Z,  is  the  /-expression  asso¬ 
ciated  with  K  and  0(Z, )  is  obtained  by  using  the  Coverage 
Algorithm. 

VI.  Conclusion 

This  paper  develops  a  new  theoretical  approach  to  deter¬ 
mining,  quantitative  bounds  on  the  multiple  fault  coverage 
capability  of  single  fault  detection  lest  sets  in  combinational 
internal  fan-out-free  networks.  The  importance  of  such  bounds 
is  underscored  by  the  development  of  VLSI  technology  where 
it  is  becoming  increasingly  evident  that  explicit  multiple  fault 
coverage  considerations  are  prohibitive.  The  main  feature  of 
our  approach  is  the  generic  representation  of  faults  in  terms 
of  /.-expressions.  Using  this  representation,  it  is  possible  to 
determine  the  greatest  lower  bound  on  the  number  of  multiple 
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faults  covered  by  any  SFDTS  on  any  set  of  lines  in  any  internal 
fan-out-free  network  by  means  of  a  reference  into  a  precal¬ 
culated  Coverage  Table.  The  generality  of  our  results  can  be 
seen  by  noting  that  various  previously  known  results  of  this 
type  simply  correspond  to  specific  entries  in  the  Coverage 
Table.  Moreover,  an  averaging  over  all  the  entries  of  the 
Coverage  Table  yields  general,  quantitative  observations,  such 
as  those  stated  in  Theorems  I  and  2. 

Although  the  discussions  in  this  paper  were  limited  to  irre- 
dundant  networks,  the  concept  of  /.-expressions  is  equally 
applicable  to  redundant  networks.  Some  preliminary  work 
along  these  lines  has  been  reported  elsewhere  [30].  Further¬ 
more,  there  are  various  other  interesting  problems  to  which 
the  application  of  the  /.-expression  concept  would  be  appro¬ 
priate: 

1 )  Since  it  can  clearly  be  observed  from  the  Coverage  Table 
that  there  are  certain  inherently  “bad”  /.-expressions  in  the 
sense  that  such  /-expressions  indicate  network  structures 
which  can  have  multiple  faults  that  are  poorly  covered  by 
SFDTS’s,  it  would  be  useful  to  develop  design  principles  which 
minimize  the  occurrence  of  such  /-expressions  in  combina¬ 
tional  networks. 

2)  The  averaging  scheme  used  in  this  paper  is  based  on  a 
heuristic  assumption  of  the  “equinumerous  occurrence”  of 
/-expression.  It  is  clear,  however,  that  more  net  work -specific 
averaging  schemes  would  lead  to  tighter  bounds. 

3)  The  extension  of  these  results  to  general  reconvergent 
fanout  network  of  Class  C3  would  be  significant.  A  possible 
approach  to  such  an  extension  might  be  developed  by  intro¬ 
ducing  the  concept  of  a  labeled  /-expression,  in  which  all  Z’s 
with  the  same  label  would  correspond  to  a  stem  line  and  its 
fan-out  branches.  Similar  to  the  approach  taken  in  this  paper, 
this  labeled  /-expression  approach  also  would  have  the  ad¬ 
vantage  of  utilizing  the  well  developed  theory  of  labeled  two- 
terminal  series-parallel  networks  |20]  to  enumerate  and  specify 
various  sets  of  labeled  /-expressions; 

4)  The  generic  representation  concept  would  be  particularly 
interesting  to  explore  under  the  constraints  of  multiple  fault 
testing  problems  in  highly  structured  devices  such  as  PLA’s 
and  ROM’s  [29], 
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This  overview  presents  several  different  types  of  circuit  switching 
networks:  concentrators ,  connectors ,  expanders ,  partitioned, 
SIMD  interconnectors,  and  sorters. 
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Circuit  switching  networks  are  systems  which 
provide  a  set  of  interconnecting  circuits  from  a  set  of 
inputs  to  a  set  of  outputs  by  opening  and  closing 
switches,  or  crosspoints.  As  a  discipline,  circuit 
switching  networks  deal  fundamentally  with  the 
design  and  analysis  of  crosspoint  patterns.  At  first 
thought,  the  idea  of  designing  a  system  for  simply  in¬ 
terconnecting  terminals  might  seem  too  basic  to  con¬ 
stitute  a  research  area.  However,  the  vast  majority  of 
the  area  of  circuit  switching  networks  lies  far  beneath 
the  surface. 


This  paper  presents  some  of  the  important  results 
of  recent  research  into  circuit  switching  networks 
and  their  complexity.  Several  different  types  of  cir¬ 
cuit  switching  networks  will  be  discussed,  together 
with  the  theoretical  complexity  and  the  best  known 
explicit  constructions  needed  to  implement  them.  As 
with  any  sampler,  the  network  designs  we  display 
will  hardly  exhaust  the  available  possibilities.  Nor  do 
we  mean  to  imply  that  those  omitted  are  held  in  lower 
regard  than  those  we  discuss.  Our  intent  is  only  to 
show  that  the  area  of  circuit  switching  networks  is 
embroidered  with  many  fascinating  designs  which 
are  now,  more  than  ever  before,  applicable  to  com¬ 
puter  system  architectures. 

Much  of  the  early  research  on  circuit  switching  net¬ 
works  was  motivated  by  the  needs  of  the  communica¬ 
tions  industry.  Indeed,  a  great  deal  of  the  work  in  this 
area  still  is  pursued  because  of  that  application. 
However,  with  the  advent  of  LSI/VLSI  technologies, 
a  circuit  switching  network  now  often  represents  a 
principal  subsystem  in  a  large  class  of  memory/ 
processor/peripheral  computer  complexes.  In  fact,  in 
many  such  systems  the  associated  circuit  switching 
network  significantly,  and  even  dominantly,  affects 
the  overall  cost  and  performance  of  the  system. 


As  generic  examples  of  their  uses  in  these  applica¬ 
tions,  three  schematics  of  computer  systems  contain¬ 
ing  circuit  switching  networks  are  shown  in  Figure  1 . 
In  Figure  1(a),  the  circuit  switching  network  is  re¬ 
quired  to  provide  paths  between  a  large  set  of  input 
devices  to  some  smaller  set  of  output  ports.  These 
output  ports  then  provide  access  for  those  input 
devices  to  some  other  devices  or  functions.  It  is  usual¬ 
ly  not  cost-efficient  or  necessary  for  general  opera¬ 
tion  to  have  one  such  device  or  function  for  each  input 
device.  Hence,  subsets  of  the  input  devices  are 
selected  by  the  circuit  switching  network  and,  in  ef¬ 
fect,  concentrated  to  the  output  side  of  the  network 
as  required  by  system  operation.  In  some  cases,  the 
output  functions  can  all  be  identical  so  that  the  order 
in  which  the  input  devices  are  interconnected  to  the 
output  ports  becomes  unimportant.  Such  an  inter¬ 
connection  requirement,  for  example,  can  be  found  in 
the  area  of  image  processing  where  the  input  devices 
are  memory  cells  and  the  output  functions  are  cor¬ 
relations,  or  where  the  input  devices  are  smart  sen¬ 
sors  and  the  output  functions  are  signal  processors. 

In  Figure  1(b),  the  circuit  switching  network  is  re¬ 
quired  to  provide  paths  between  specified  devices  at 
the  input  side  to  specified  devices  or  sets  of  devices  at 
the  output  side.  Some  or  all  of  the  devices  at  the  input 
and  output  sides  might  actually  be  identical,  but  the 
general  operation  can  involve  them  in  such  a  way  that 
they  must  be  treated  as  though  they  were  dis¬ 
tinguishable.  Hence,  the  input  devices  are  connected 
in  a  one-to-one  manner  or  expanded  in  a  one-to-many 
manner  by  the  network  to  the  output  devices  as  re¬ 
quired  by  system  operation.  Such  an  interconnection 
requirement  can  be  found  in  numerous  parallel  pro¬ 
cessing  environments.  For  example,  if  the  input 
devices  are  processors  and  the  output  devices  are 
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Figure  1.  Generic  examples  of  computer  systems  containing  circuit  switching  networks:  (a)  concentration,  (b)  connec¬ 
tion/expansion,  and  (c)  partitioning. 


memory  units,  then  we  have  what  is  often  referred  to 
as  a  data  alignment/data  access  interconnection  re¬ 
quirement. 

Finally,  in  Figure  1(c),  the  circuit  switching  net¬ 
work  must  partition  the  set  of  input  devices  into  dis¬ 
joint  subsets  so  that  the  subsets  can  function  as  in¬ 
dependent  subsystems.  Here,  many  different  parti¬ 
tioning  requirements  are  possible  depending  on 
whether  or  not  the  devices  are  distinguishable  and 
whether  or  not  the  specific  output  ports  to  which  they 
are  interconnected  is  important. 


Terminology 

A  circuit  switching  network  can  be  described  as  a 
set  of  contact  switches,  generically  called  cross- 
points,  joined  together  by  links  in  order  to  intercon¬ 
nect  network  input  terminals  (inputs)  to  network  out¬ 
put  terminals  (outputs).  Circuit  switching  networks 
of  interest  must  be  capable  of  doing  this  for  a  very 
large  number  of  interconnection  requirements.  An 
example  of  such  a  network  is  explicitly  shown  in 
Figure  2(a)  for  AT-inputs  and  Af -outputs.  This  network 
is  known  as  an  (NXMhcomplete  crossbar  network,  or, 
more  simply,  as  an  f/VXAO-network.  Clearly,  the 
number  of  crosspoints  is  NM,  since  there  is  one  cross- 
point  between  each  of  the  inputs  and  each  of  the  out¬ 


puts.  As  we  will  see,  many  well-known  circuit  switch¬ 
ing  network  constructions  consist  of  interconnected 
complete  crossbar  networks.1-4 

However,  not  all  crossbar  switches  of  interest  are 
complete.  A  sparse  crossbar  switch  has  some  inputs 
and  outputs  with  no  crosspoint  between  them  and, 
consequently,  between  which  no  direct  interconnec- 
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tion  is  possible.  An  example  of  this  type  of  construc¬ 
tion  is  the  binomial  network*  shown  in  Figure  2(b). 

Besides  the  illustrations  of  Figures  2(a)  and  2(b), 
which  explicitly  show  inputs,  outputs,  and  cross- 
points,  another  means  of  describing  crossbar  net¬ 
works  is  to  model  them  as  bipartite  graphs.  In  addi¬ 
tion,  the  adjacency  matrix  of  a  bipartite  graph  can  be 
used  to  represent  crossbar  networks  where  the  ones 
represent  crosspoint  placements.  This  is  illustrated 
in  Figure  3.  In  Figure  3(a),  a  (4  X3)-network  is  shown 
explicitly;  Figure  3(b)  is  a  node-edge  representation  of 
the  network,  with  the  12  edges  representing  the  12 
crosspoints;  and  in  Figure  3(c)  we  have  the  corre¬ 
sponding  graph  adjacency  matrix.  If  it  is  necessary 
to  indicate  the  direction  of  signal  flow  in  the  graph, 
the  edges  of  the  graph  representing  the  network  can 
be  directed  from  inputs  to  outputs.  Henceforth,  we 
will  illustrate  our  circuit  switching  networks  with 
whichever  representation  is  most  convenient  to  the 
immediate  discussion. 

Circuit  switching  networks  are  usually  classified 
according'to  the  types  of  interconnections  which  they 
must  provide.  In  the  following  sections  we  will 
discuss  some  aspects  of  the  circuit  switching  net¬ 
works  known  as  concentrators,  connectors,  ex¬ 
panders,  partitioners,  SIMD  interconnectors,  and 
sorters.  Definitions  of  the  input  to  output  intercon¬ 
nection  requirements  which  each  of  these  networks 
satisfy  are  treated  in  their  respective  sections.  For 
each  such  type  of  circuit  switching  network, 
however,  two  important  aspects  relating  to  their  in¬ 
terconnecting  capabilities  should  be  pointed  out 
here.  Note  that  the  circuit  switching  network  of 
Figure  3  can  interconnect  all  possible  combinations 
of  inputs  to  outputs;  and  it  can  provide  any  new  addi¬ 
tional  interconnection  between  an  unused  (or  “idle”) 
input  to  a  not  necessarily  idle  output  regardless  of  the 
other  interconnections  which  are  currently  in  pro¬ 
gress.  A  network  with  this  property  is  said  to  be  non- 
blocking.  In  other  words,  a  nonblocking  network  can 
satisfy  any  new,  valid  interconnection  request  in  the 
sense  of  providing  a  path  for  that  request  without 
disturbing  any  of  the  other  existing  interconnections 
in  the  network.  This  idea  can  be  made  somewhat  more 
formal  by  noting  that  a  specification  of  the  input-to- 
output  interconnection  requirements  at  any  time  in  a 
network  is  often  referred  to  as  an  interconnection 
assignment,  and  it  is  sometimes  described  as  a  map¬ 


ping  of  the  inputs  onto  the  outputs.  A  total  assign¬ 
ment  includes  all  the  inpuls/outputs.  A  detailing  of 
the  opened  and  closed  crosspoints  at  any  time  in  a 
network  is  referred  to  as  the  state  of  the  network. 
Then,  a  nonblocking  network  in  any  state  satisfying 
some  valid  interconnection  assignment  can  satisfy 
any  new,  valid  interconnection  assignment  contain¬ 
ing  the  previous  assignment  by  being  placed  in  a  new 
state  containing  the  old  state.  It  does  this  by  closing 
crosspoints  to  provide  the  additional  interconnecting 
paths  without  any  disturbance  to  existing  paths. 

As  opposed  to  the  above,  suppose  that  the  network 
interconnection  requirement  is  to  provide  disjoint  in¬ 
terconnections  from  any  specified  set  of  inputs  to 
some  specified  set  of  the  outputs.  Suppose  further 
that,  for  some  assignment  where  interconnections 
are  in  progress,  an  additional  request  can  sometimes 
only  be  realized  by  rearranging  some  of  the  existing 
interconnecting  paths.  A  network  which  provides  in¬ 
terconnections  in  this  manner  is  said  to  be  a  rear- 
rangeable  network.  Hence,  placing  a  rearrangeable 
network  in  a  new  state  to  satisfy  an  interconnection 
assignment  containing  the  presently  satisfied  inter¬ 
connection  requirement  sometimes  requires  the 
disturbance  of  existing  paths.  In  such  networks,  the 
upper  bound  on  the  maximum  number  of  rear¬ 
rangements  necessary  to  provide  for  any  additional 
interconnection  is  usually  significant. 

Two  of  the  main  circuit  switching  network  design 
techniques  are  decomposition  and  staging.  The 
former  consists  of  decomposing  the  initial  intercon¬ 
nection  requirement  into  a  set  of  subrequirements, 
each  of  which  is  similar  in  nature  to  the  initial  require¬ 
ment  but  usually  smaller  in  size  and,  therefore,  less 
complex  to  realize.  This  decomposition  is  usually  con¬ 
tinued  on  the  subrequirements  to  produce  even  finer 
subrequirements.  This  is  then  recursively  continued 
until  it  is  feasible  to  satisfy  the  finest  subrequirement 
with  a  brute-force  or  obvious  design.  Proceeding 
backwards  relative  to  the  decomposition  technique 
towards  the  initial  interconnection  requirement  ef¬ 
fectively  specifies  a  structure  which  satisfies  the 
original  requirement.  The  overall  structure  is  then 
referred  to  as  a  multi-stage  structure.  Each  stage  will 
often  consist  of  a  set  of  subnetworks  linked  to  the 
subnetworks  of  other  stages.  For  example,  an  output 
terminal  of  a  subnetwork  of  one  stage  might  be  linked 
to  an  input  terminal  of  a  subnetwork  of  another  stage. 
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Figure  3.  Representations  of  a  complete  crossbar  switch:  (a)  crossbar,  (b)  bipartite  graph,  G,  (c)  adjacency  matrix  of  G. 
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In  the  following,  in  illustrations  of  multi-stage  net-  greater  than  M,  the  number  of  outputs,  are  of  in¬ 
works  showing  explicit  crosspoint  designs,  the  links  terest.  We  will  let  the  capacity  of  a  concentrator  be 

will  be  shown  as  solid  lines  connecting  the  terminals  denoted  by  R,  where  R  is  the  maximum  number  of  in- 

(input  and  output)  of  one  stage  with  the  terminals  of  terconnections  that  can  be  made  simultaneously 

other  stages.  Such  links  can  simply  be  viewed  as  through  the  network.  In  other  words,  an  Ninput  to M 

hardwired  interconnectors.  In  the  graph  models  of  in-  output  concentrator  of  capacity  R  is  a  circuit  switch- 

terconnection  networks,  the  links  will  either  be  ing  network  capable  of  interconnecting  any  of  the  (£) 

shown  as  dotted  lines  (so  as  not  to  confuse  them  with  choices  of  inputs,  K  <  R,  to  some  K  of  the  outputs, 

edges  which  correspond  to  crosspoints),  or,  when  ap-  The  important  point  is  that  the  inputs  can  be  a  priori 

propriate,  the  output  nodes  of  one  stage  will  be  specified  but  the  outputs  to  which  they  will  be  con- 
superimposed  on  the  input  nodes  of  the  stage  to  nected  cannot  be  so  specified.  Hence,  a  concentrator 

which  they  would  be  (inked  in  an  explicit  construe-  can  be  functionally  described  by  a  triplet  of  integers 

tion.  When  discussing  graph  models,  we  will  often  W,  M,  R,),N>  M  >  R,  where 

refer  to  the  input/output  nodes  of  a  model  as  simply 

the  inputs/outputs  of  the  network,  and  the  input/out-  "  ~~  number  of  inputs, 

put  nodes  of  a  stage  as  simply  the  inputs/outputs  of  „  ~  num  . r  °‘  outPu  s> 

the  stage.  R  =  capacity. 

Implicit  in  the  concept  of  a  multi-stage  design  of  a  Figure  3  is  a  (4  X3)-network  which  is  functionally  a 
circuit  switching  network  is  a  tradeoff  which  is  at  the  (4,3,3)-nonblocking  concentrator.  Indeed,  it  is  clear 
center  of  much  of  the  research  in  the  area.  Namely,  we  that  an  (jVXA/)-network  is  an  (N, MM  (-nonblocking 
will  be  motivated  to  consider  multi-stage  designs  for  concentrator.  However,  concentrators  with  fewer 
various  types  of  interconhection  requirements  than  NM  crosspoints  are  known.  The  second  stage  of 
because  such  designs  will  often  be  shown  to  employ  Figure  5  is  a  (6,4,4)-rearrangeable  concentrator  in 
far  fewer  crosspoints  than  straightforward,  one-  which  the  pattern  of  crosspoints  connecting  the  out- 
stage  alternatives.  However,  the  overall  reduction  in  puts  to  the  inputs  consists  of  all  the  possible  (J) 
crosspoints  will  be  paid  for  by  the  added  complexity  choices  of  two  crosspoints  between  the  inputs  and  the 
of  realizing  assignments  in  the  network  in  the  sense  four  outputs.  This  configuration,  called  a  binomial 
of  computing  states  to  satisfy  requests,  and  in  the  concentrator,  can  be  shown  to  have  a  capacity  of  4.5 
delays  of  the  network  in  the  sense  that  a  signal  flow-  Concentrators  were  first  defined  by  Pinsker,6  who 
ing  from  the  network  input  to  the  network  output  will  proved  that  there  exist  (N,  Af,  Af)-rearrangeable  con- 

be  required  to  pass  through  more  than  one  cross-  centrators  for  all  N  >  M  with  at  most  29N  cross- 

point.  This  size  versus  control  conflict  will  be  seen  points.  One  particularly  interesting  aspect  of  Pin- 
repeatedly  in  the  following  sections.  sker's  network,  as  shown  in  Figure  4,  is  his  recursive 

Accordingly,  the  role  of  complexity  theory  in  pro-  use  of  a  network  structure  in  order  to  obtain  this 
viding  bounds  on  the  maximum  and  minimum  bound.  As  discussed  earlier,  it  is  common  to  exploit  a 
number  of  crosspoints  required  in  order  to  realize  a  structural  concept  recursively,  starting  with  the 
given  interconnection  requirement  is  significant  in  complete  problem  and  using  some  insightful  observa- 
the  area  of  circuit  switching  networks.  Lower  bounds  tion  to  form  subproblems  which  can  be  further 
provide  a  theoretical  minimum,  below  which  it  is  decomposed.  The  key,  of  course,  is  to  determine  the 
futile  to  attempt  to  construct  practical  networks;  and  structural  concept  which  provides  some  reduction  in, 
upper  bounds  provide  a  convenient  benchmark  with  for  this  case,  the  number  of  crosspoints  (or  edges  in 
which  to  measure  proposed  designs.  In  the  following,  the  graph  model)  relative  to  a  benchmark  and, 
for  each  type  of  circuit  switching  network,  we  will  perhaps,  even  agrees  asymptotically  with  the  infor- 
give  the  known  theoretical  and  constructional  mation  theoretical  optimum 

bounds.  For  those  bounds,  all  logarithms  will  be  to  Thus,  an  upper  bound  on  the  necessary  asymptotic 
the  base  2  unless  otherwise  specified.  It  is  interesting  growth  of  the  number  of  crosspoints  in  an 
to  point  out  here  that  along  these  lines  we  will  see  ( N, M, M)-rearrangeable  concentrator  as  the  number 
cases  in  the  following  sections  where  a  decomposition  of  inputs,  N,  grows  is  on  the  order  of  N  crosspoints, 
of  a  class  of  assignments  can  be  described,  and  the  ex-  that  is,  OiN  crosspoints.  Relative  to  the  information 
istence  of  a  subnetwork  which  satisfies  the  assign-  theoretical  bound,  this  is  the  best  order  of  growth 
ments  with  a  given  bound  on  its  number  of  cross-  possible.  However,  it  must  be  pointed  out  here  that 
points  can  be  proven,  but  where  no  explicit  practical  Pinsker's  results  are  non-constructive  in  that  they 
construction  technique  for  designing  these  networks  aj-e  not  generally  based  on  explicit  constructions, 
is  known.  Rather,  they  are  based  on  a  proof  of  only  the  existence 

of  an  (N,MR(-rearrangeable  concentrator  for  all  R  < 
M.  It  is  significant  that  there  does  not  exist  a  method 
Concentrators  at  present  (other  than  an  exhaustive  consideration  of 

all  possibilities)  which  leads  to  their  explicit  construc- 

A  concentration  circuit  switching  network  or,  more  tion  for  all  N. 
simply,  a  concentrator  interconnects  specific  idle  in-  There  have,  however,  been  some  other  recent  con- 
puts  to  arbitrary  idle  outputs  by  providing  disjoint  tributions  regarding  constructions.  Margulis,*  on  the 
circuits  from  the  inputs  to  the  outputs*"7  Clearly,  only  basis  of  a  complex  argument  involving  the  theory  of 
concentrators  where  N,  the  number  of  inputs,  is  group  representations,  has  given  a  construction 
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Figure  4.  Pinsker’s  concentration  network. 


technique  which  results  in  a  concentrator  of  predeter¬ 
mined  capacity  with  K  stages,  wherein  each  stage  re¬ 
quires  at  most  5 IV  crosspoints.  However,  we  are  not 
aware  of  any  results  that  firmly  establish  a  bound  for 
K.  This  unfortunately  jeopardizes  the  applicability  of 
Margulis'  results. 

A  known  constructive  result  is  Masson’s  binomial 
concentrator4  (see  Figure  5),  which  shows  that  the 
(^-network  is  a  (15,6,4)-rearrangeable  concentrator 
and  that  the  (J)-network  is  a  (6,4,4)-rearrangeable 
concentrator.  Hence  the  cascade  of  Figure  5  is  a 
(15,4,41-rearrangeable  concentrator  with  only  42 
crosspoints.  This  can  be  favorably  compared  with  a 
benchmark  of  a  complete  bipartite  graph  with  15  in¬ 
puts  and  four  outputs  representing  a  concentrator 
which  has  60  crosspoints.  A  practical  aspect  of  the 
use  of  the  binomial  construction  is  that  an  upper 
bound  of  one  plus  the  number  of  stages  can  be  shown 
to  exist  for  the  number  of  existing  paths  which  must 
be  rearranged  to  satisfy  a  new  request  in  its  opera¬ 
tion. 

In  general,  the  graph  model  of  an  (y)-binomial 
circuit  switching  network  or,  more  simply,  an 
(y)-network,  has  (y)-inputs,  A'-outputs,  Y-edgesI 
input,  y(y)/X  edges/output,  and  a  capacity  of  Y+2. 
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Figure  5.  Masson's  binomial  concentrator. 


Thus  it  can  functionally  be  described  as  an 
y+2)-rearrangeable  concentrator. 

Another  construction  of  a  concentrator  is  an  in¬ 
teresting  generalization  of  the  basic  binomial  net¬ 
work.  Recall  that  the  adjacency  matrix  of  the  graph 
representing  a  network  is  composed  of  zeros  and 
ones,  where  each  of  the  non-zero  entries  denote  the 
position  of  edges  between  the  vertices  of  the  graph 
corresponding  to  the  inputs  and  outputs  of  the  net¬ 
work.  Suppose  now  that  instead  of  each  vertex  of  this 
graph  representing  only  a  single  input  or  output  ter¬ 
minal,  it  represented  a  set  of  input  or  output  ter¬ 
minals,  and  in  the  same  sense,  suppose  t  hat  each  edge 
in  this  graph  model  represented  a  binomial  network 
providing  interconnecting  paths  between  those  input 
terminals  and  output  terminals.  The  capacity  of  the 
resulting  composite  network  would  clearly  be  a  func¬ 
tion  of  the  base  network  and  the  particular  replace¬ 
ment  networks  used  for  the  replacement  of  each  non¬ 
zero  in  the  graph  adjacency  matrix.  For  example,  if 
the  graph  of  a  (j  )-network  is  used  not  only  as  the  base 
graph,  but  also  as  the  replacement,  the  result  is  a 
(36,16,ll)-rearrangeable  concentrator  with  only  144 
edges.9  Note  that  a  36-input,  16-output  network  with 
1 1  crosspoints  between  each  input  and  the  outputs 
would  require  396  crosspoints. 

Such  a  network  corresponds  to  a  super  binomial 
network.  More  explicitly,  it  can  be  described  as  a 
(2)  -network.  Ingeneral, any  (^y2)2  graph,  where  Y> 
2,  has  capacity  of  R  =  (Y+l)  (y+2)— 1,  and  thus  it 
represents  a  ((y£2)  2,  (Y+2F,  (Y+1)  (y+2)-l)-rear- 
rangeable  concentrator.9  Such  concentrators  are  at¬ 
tractive  because  they  are  sparse,  and  they  are  model¬ 
ed  as  a  single,  incomplete  bipartite  graph.  The  more 
general  class  of  (y)"  networks  can  be  designed  in  an 
analogous  manner. 

Regarding  nonblocking  concentrators,  Pippenger7 
has  derived  the  (somewhat  surprising)  result  that  the 
number  of  crosspoints  of  such  networks  is  CXMogN). 
As  we  will  see,  this  is  asymptotically  the  same  as  that 
required  for  nonblocking  connectors,  although  the 
former  intuitively  seems  as  though  it  should  be  much 
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less  difficult  to  implement  than  the  latter.  Little  is 
known  about  minimal  explicit  constructions  for 
nonblocking  concentrators. 

Af-superconcentrators10'11  are  a  type  of  rear- 
rangeable  circuit  switching  network  with  more 
powerful  interconnecting  capabilities  than  a  concen¬ 
trator.  In  the  graph  model  of  an  A^-superconcentrator 
with  AT  inputs  and  AT  outputs  for  every  set  of  R  <  N  in¬ 
puts  and  every  set  of  R<  N  outputs,  there  exists  a  set 
of  R  vertex  disjoint  paths  from  the  set  of  inputs  to  the 
set  of  outputs.  Thus,  in  AAsuperconcentrators  the 
sets  of  input  and  output  terminals  to  be  intercon¬ 
nected  can  both  be  specified,  but  the  individual  input- 
to-output  interconnections  within  the  sets  cannot  be 
specified.  Clearly,  superconcentrators  are  signifi¬ 
cantly  more  powerful  than  regular  concentrators,  yet 
Pippenger11  has  proven  the  existence  of  Af-super- 
concentrators  with  at  most  40N  crosspoints.  Again, 
however,  no  explicit  construction  which  generally 
satisfies  the  bound  is  known.  Pippenger’s  proof  is 
based  on  the  existence  of  (6m,4m,3m)-rearrangeable 
concentrators  with  only  36m  crosspoints.  These  con¬ 
centrators  are  used  in  the  recursive  construction  il¬ 
lustrated  in  Figure  6. 

It  is  worth  pointing  out  here  that  Pippenger 
establishes  his  results  on  superconcentrators  with  a 
probabilistic  approach.  The  principal  idea  behind 
such  an  approach  is  to  show  that  the  probability  that 
a  network  exists  is  greater  than  zero,  or  conversely, 
the  probability  that  it  does  not  exist  is  less  than  one. 
Proving  either  conjecture  implies  that  the  network 
exists,  although  it  does  not  explicitly  lead  to  a  con¬ 
struction.  Besides  Pippengers’s  paper,"  a  more 


thorough  treatise  of  this  type  of  probabilistic  ap¬ 
proach  is  provided  by  Erdos  and  Spencer.12 

Another  type  of  concentrator  which  is  less  power¬ 
ful  than  an  AT-superconcentrator  is  an  AMiyper- 
concentrator,  which  is  a  rearrangeable  circuit  switch¬ 
ing  network  with  N  inputs  and  N  outputs  in  which 
any  specified  set  of  k<N inputs  can  be  interconnected 
in  some  arbitrary  order  to  the  first  k  outputs.  Clearly, 
an  A-hyperconcentrator  has  less  interconnecting 
capability  than  that  of  an  AT-superconcentrator. 
Thompson13  has  shown  that  a  network  called  an 
(Af,  Af)-infrageneralizer  which  is  due  to  Ofman14  can  be 
reversed  (in  the  sense  that  inputs  of  the  (AT.AO-infra- 
generalizer  are  considered  as  the  outputs  of  the 
AAhyperconcentrator  and  the  outputs  of  the 
(fV,W)-infrageneralizer  are  considered  as  the  inputs  of 
the  N-hyperconcentrator)  to  produce  an  N-hyper- 
concentrator.  An  8-hyperconcentrator  is  illustrated 
in  Figure  7.  It  is  easily  shown  that  such  A^hyper- 
concentrators  have  2N\ogN  crosspoints.13  This  par¬ 
ticular  construction  of  an  Af-hyperconcentrator  has 
the  enhanced  capability  of  interconnecting  the 
selected  k<N inputs  to  any  k<N consecutive  outputs 
(allowing  for  wraparound)  while  maintaining  the 
order  of  the  inputs. 


Connectors 

In  a  connection  circuit  switching  network,  specific, 
idle  inputs  request  interconnecting  paths  to  specific, 
idle  outputs.  Because  of  the  affinity  of  such  networks 
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with  telephony,  much  has  been  published  on  this  sub¬ 
ject.  Moreover,  if  N  is  the  number  of  inputs  to  a  con¬ 
nector  and  M  is  the  number  of  outputs,  then  when 
N=M,  a  connection  assignment  of  all  inputs  onto 
all  IV  outputs  can  be  represented  by  a  bijective  map¬ 
ping  of  the  inputs  onto  the  outputs;  in  other  words, 
the  assignment  is  a  permutation  of  the  inputs.  This 
observation  permits  the  application  of  a  broad  range 
of  mathematical  theory  to  the  design  of  connectors. 
Since  results  for  other  situations  follow  from  the 
results  for  the  case  when  N  ■=  M,  we  will  discuss  this 
case  only  in  the  following. 

It  is  clear  that  since  a  complete  crossbar  network 
has  one  crosspoint  between  every  input  and  output, 
such  a  network  will  operate  as  a  nonblocking  connec¬ 
tor.  Hence,  if  we  refer  to  an  N  input,  N  output,  connec¬ 
tion  network  of  capacity  N  as  an  (N,N,N)-irear- 
rangeable  or  nonblocking)  connector,  an  upper  bound 
on  the  number  of  crosspoints  of  such  a  connector  is 
IV2.  This  then  is  a  benchmark  with  which  to  compare 
our  subsequent  reported  results. 

To  get  an  information  theoretical  lower  bound  on 
the  order  of  growth  of  the  number  of  crosspoints,  we 
can  use  an  argument  given  by  Shannon. 15  Since  there 
are  effectively  TV!  assignments  which  must  be  real¬ 
ized  by  an  (fV,fV,lV>-connector,  and  since  each  cross- 
point  in  such  a  network  effectively  has  two  states, 
opened  or  closed,  2  raised  to  the  power  the  number  of 
crosspoints  must  be  greater  than  or  equal  to  Nl  F rom 
Stirling's  well-known  formula  for  the  approximation 
of  a  factorial,16  it  follows  that  the  asymptotic  growth 
of  the  number  of  crosspoints  is  CHNlogN).  We  will 
now  compare  some  designs  with  this  information 
theoretical  lower  bound  on  the  number  of  cross- 
points. 

We  will  first  consider  rearrangeable  connection 
networks.  Benes’  classic  book1  describes  a  construc¬ 
tion  which  is  based  on  Hall's  theorem11  (sometimes 
called  “the  marriage  theorem”).  For  a  three-stage 
network,  the  approach  uses  Hall's  theorem  to  decom- 
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pose  the  original  assignment  describing  the  mapping 
of  the  N  inputs  onto  the  N  outputs  into  n 
subassignments.  Repeating  this  decomposition  on 
the  subassignments  leads  to  a  multistage  connector. 
The  structure  of  such  a  network  is  explicitly  shown  in 
Figure  8  in  a  three- stage  form.  In  this  form,  the  struc¬ 
ture  consists  of  the  following;  (1)  a  first  stage  (also 
called  an  input  stage)  of  r,  (n,Xm)-networks,  where 
the  n^j  =N  terminals  on  the  left  (input)  side  of  this 
stage  are  the  inputs  to  the  overall  network;  (2)  a  last 
stage  (also  called  an  output  stage)  of  r2  (mXn2)- 
networks  where  the  n2r2=lV  terminals  on  the  right 
(output)  side  of  this  stage  are  the  outputs  to  the 
overall  network;  (3)  a  middle  stage  of  m 
(r,Xr2)-networks  where  the  terminals  on  the  input 
side  of  thi  s  middle  stage  are  linked  to  the  terminals  on 
the  output  side  of  the  input  stage  so  that  each  of  the 
m  (r,  Xr2l- net  works  in  this  middle  stage  has  exactly 
one  link  to  each  of  the  r,  (n ,  Xm)-net  works  in  the  input 
stage,  and  where  the  terminals  on  the  output  side  of 
this  middle  stage  are  linked  to  the  terminals  on  the  in¬ 
put  side  of  the  output  stage  so  that  each  of  the  m 
(r,  Xr2)-net works  in  this  middle  stage  has  exactly  one 
link  to  each  of  the  r2  (m  X  n2)-net works  in  the  output 
stage.  It  can  be  shown  for  this  network1  that  if 
m^maxln,,  n2|,  then  the  result  is  an  (TV, AT, IV)- 
connector.  Note  that  if  ni=rl=n2—r2=I\P'2,  then  the 
number  of  crosspoints  in  the  network  is  31V3'2  — 31V. 
Moreover,  it  can  be  shown16  that  the  maximum 
number  of  interconnecting  paths  through  the  net¬ 
work  which  must  be  rearranged  to  satisfy  a  new  inter¬ 
connection  request  from  an  idle  input  to  an  idle  out¬ 
put  is,  in  general,  for  such  three-state  structures 
|max(/-„r2)-l). 

Beizer,19  Joel,20  Waksman,21  and  Opferman  and 
Tsao-Wu22  give  refinements  of  this  approach  with  a 
decomposition  which  employs  the  repeated  use  of 
(2X2)-networks  as  the  basic  element  of  the  network. 
The  results  are  constructions  in  which  the  number  of 
crosspoints  has  O(MoglV)  growth  with  N,  which  by 
our  above  discussion  is  the  best  order  of  growth 
possible.  For  example,  in  the  (IV,  N,  NY Waksman  con¬ 
nector,  shown  for  the  1V= 8  case  in  Figure  9,  there  are 
4NlogN—4N+4  crosspoints  in  general.  Benes1  has 
also  shown  that  if  the  decomposition  utilizes  a 
(3X3)-network,  then  the  number  of  crosspoints  of 
3.8MoglV  crosspoints  grows.  It  is  worth  noting  that 
an  (1V,1V, lV>Waksman  network  has  fewer  crosspoints 
than  the  (/VX/Vj-network  for  all  ^>16. 

It  should  be  clear  that  for  these  networks  the 
number  of  stages,  and,  therefore,  the  number  of  cross- 
points  involved  in  any  input-output  path,  often  re¬ 
ferred  to  as  the  delay,  will  be  OOoglV).  For  example, 
the  {N,N,N)- Waksman  connector  has  a  2logN—l 
delay  in  this  sense.  This  means,  of  course,  that  a 
realizing  state  to  satisfy  a  request  can  be  quite  com¬ 
plex.  Effectively,  the  use  of  a  minimal  crosspoint 
design  implies  that  a  tradeoff  has  been  accepted 
resulting  in  fewer  crosspoints  at  the  cost  of  greater 
control  complexity  and  delay.  Regarding  this  latter 
point  somewhat  more  specifically,  given  a  request  to 
be  satisfied  by  one  of  the  above  O(MoglV)  connection 
networks,  Waksman21  and  Tsao-Wu  and  Opferman22 
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Figure  8.  A  three-state  interconnection  network  structure. 


give  algorithms  which  require  O(MogAT)  operations. 
Recently,  Lenfant23  has  developed  some  control 
algorithms  which  take  advantage  of  an  apriori 
knowledge  of  “frequently  used”  requests  to  alleviate 
this  control  problem  somewhat.  However,  for  some 
real-time  applications,  set-up  times  might  still  be  pro¬ 
hibitive.  Indeed,  delay  and  set-up  time  perhaps  repre¬ 
sent  the  most  serious  limitation  of  O(MogJV)  designs 
of  {N,N,  AO-rearrangeable  connectors. 

There  are  applications  where  not  all  mappings  of 
the  inputs  to  the  outputs  are  of  interest.  Lawrie24  has 
proposed  what  is  referred  to  as  an  omega  network  for 
interconnections  required  in  the  access  and  align¬ 
ment  of  data  in  an  array  processor.  The  omega  net¬ 
work  structure  is  effectively  the  left-hand  half  (in¬ 
cluding  the  middle  stage)  of  the  Waksman  structure 
where  the  linking  between  the  stages  is  usually  in  a 
perfect-shuffle25  form.  Clearly,  not  all  permutations 
of  the  inputs  to  the  outputs  can  be  realized.  However, 
Lawrie  shows  that  the  data  elements  of  an  (nXn| 
matrix  can  be  stored  using  a  skewing  scheme  in  2 n  in¬ 
dependent  memory  units  (each  data  element  being 
placed  in  only  one  such  unit)  so  that,  in  a  conflict-free 
manner,  rows,  columns,  diagonals,  backward  diagon¬ 
als,  and  square  blocks  in  row-major  or  column-major 
order  can  be  accessed  and  changed  by  an  omega  net¬ 
work. 

We  next  consider  nonblocking  connection  net¬ 
works.  The  first  published  work  regarding  the  con¬ 
struction  of  such  networks  is  due  to  Clos.2®  By  means 


of  a  straightforward  argument,  he  showed  that  for 
three  stages  of  the  structure  of  Figure  8.  if  m  > 
/ii  +n2— 1,  the  network  has  nonblocking  cap  ability. 
Note  that  if  n1=r,=n2=r2=Ar1'2  then  m=2A^'2—l, 
and  the  number  of  crosspoints  in  the  resulting  net- 
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work  is  6N  3,2  —3 N.  It  is  easily  seen  that  the  three- 
stage  nonblocking  Clos  connector  has  fewer  cross- 
points  than  an  WXNYnetwork  for  N> 36.  Further¬ 
more,  Cantor27  shows  that  with  the  Clos  argument 
applied  recursively,  a  structure  with  OIM logN)2269  ) 
results. 

Bassalygo  and  Pinsker23  have  shown  that,  in¬ 
deed,  constructions  for  (N.Af.M-nonblocking  connec¬ 
tors  do  exist  for  which  the  number  of  crosspoints  is 
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Figure  10.  (4,4,4)-Cantor  network. 
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Figure  1 1 .  Generalized  model  of  a  Cantor  network. 


OiNlogN).  However,  these  results  are  non-con- 
structive  in  the  sense  previously  described,  because 
their  proof  of  the  existence  of  such  (N.N.Nbnonblock- 
ing  connectors  is  based  on  a  proof  of  the  existence 
(but  not  the  construction)  of  an  N  input,  N  output 
sparse  network  where  each  input  has  crosspoints  be¬ 
tween  it  and  12  of  the  N  outputs  and  through  which 
any  choice  of  one-third  of  the  inputs  can  be  connected 
to  more  than  two-thirds  of  the  outputs.  Such  sparse 
networks  could  be  linked  together  (if  their  explicit 
constructions  were  known)  to  yield  an  (N,N,N)-non- 
blocking  connector  with  O  iNlogN)  crosspoints.  More 
precisely,  Pippenger2®  has  recently  shown  that  such 
networks  exist  with  90  Mog3Af  crosspoints. 

The  minimal  known  construction  of  an  (N,N,  NY- 
nonblocking  connector  has  been  provided  by 
Cantor.27-30  Since  this  construction  is  considerably 
different  from  the  other  connectors  discussed  so  far, 
it  would  be  appropriate  to  sketch  it  here.  Similar  to 
the  (Ar,Ar,Ar)-CIos  network.  Cantor's  network  has  a 
multiple  stage  structure  which  is  symmetric  around 
the  middle  stage.  Cantor's  approach  is  to  use  a  par¬ 
ticular  construction  technique  and  then  consider  the 
number  of  accessible,  idle  terminals  in  the  middle 
stage  of  the  resulting  network  relative  to  any  idle  in¬ 
put.  This  number  can  be  easily  calculated  in  a  recur¬ 
sive  manner  because  of  the  construction  technique 
used.  By  symmetry,  the  number  is  also  known 
relative  to  any  idle  output.  The  sum  of  the  two 
numbers  when  compared  with  the  total  number  of 
idle  terminals  in  the  middle  stage  at  any  time  in¬ 
dicates  that  there  is  a  common  idle  terminal  in  the 
middle  stage  through  which  the  idle  input  can  be  con¬ 
nected  to  the  idle  output. 

To  see  this  in  more  detail,  consider  the  graph  model 
of  a  (4,4,4)-Cantor  network  (shown  in  Figure  10). 
Recall  that  in  such  a  graph  model  the  dotted  lines  do 
not  represent  crosspoints  but  simply  associate  nodes 
which  correspond  to  terminals  which  would  be  linked 
together  in  a  construction.  Hence,  there  are  four 
(1  X2)-net  works  in  both  the  input  stage  and  the  out¬ 
put  stage;  and  there  are  four  (2X2)-networks  in  each 
of  the  three  intermediate  stages.  It  should  be  noted 
also  that  the  association  of  nodes  between  stages  in 
the  model  corresponds  to  the  so-called  perfect- 
shuffle25-31-32  in  a  progressing  manner.  We  will  now 
analyze  the  input  side  of  the  network,  and  by  sym¬ 
metry  our  observations  will  also  hold  for  the  output 
side.  On  the  input  side,  a  generalization  of  network 
modeled  by  Figure  1 1  would  indicate  that  each  of  the 
N  inputs  of  the  first  stage  is  in  a  (1  XlogAO-network. 
This  would  be  followed  by  log  N  stages,  each  stag* 
consisting  of  INI 2)  logjV(2  X  2)  networks,  where  the  last 
of  these  IogJV  stages  (actually  thellogJV  +1)°’  stage  in 
the  network)  is  the  middle  stage.  Again,  the  perfect 
shuffle  concept  is  used  to  progressively  associate  th) 
nodes  in  the  stages  in  the  model.  To  further  illustrate 
the  above.  Figure  12  shows  the  input  side  of  a( 
(8,8,81-Cantor  network. 

As  seen  from  Figure  12  any  input  is  connectable  II 
log/V  intermediate  terminals  which  are  linked  to  th 
inputs  of  the  loglV  (2X2)-networks  of  the  secoa 
stage.  Using  these  (2X2)-networks  in  the  secoq 
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stage,  we  then  see  that  each  of  the  inputs  of  the  first 
stage  is  connectable  to  2logN  output  terminals  of  the 
second  stage.  For  a  given  input,  we  will  call  the  nodes 
representing  these  output  terminals  of  the  (in  this 
case)  second  stage  the  accessible  nodes  of  that  input. 
The  crucial  point  here  is  that  because  of  the  construc¬ 
tion  of  the  network,  the  number  of  accessible  nodes  of 
an  input  increases  by  a  factor  of  two  each  time  we 
move  a  stage  closer  to  the  middle  stage.  Hence,  when 
the  middle  stage  is  reached,  the  number  of  accessible 
nodes  to  any  input  is  N\ogN. 

But  it  is  clear  that  connections  that  have  been 
previously  made  and  are  thereby  existing  in  the  net¬ 
work  at  a  given  time  will  limit  the  actual  accessibility 
of  any  input  to  terminals  of  the  middle  stage.  Hence, 
we  see  that  our  concern  must  be  with  idle,  accessible 
nodes.  Let  us  then  consider  the  worst  case  along 
these  lines  for  stage  i,  i—  1,.  .  .,  logN  +1.  We  will 
denote  the  number  of  idle,  accessible  nodes  of  stage  t 
to  any  input  asA(i).  By  the  construction  indicated  in 
Figure  11, 

A{1)  =  logN. 

Now,  the  number  of  accessible  nodes  on  the  output 
side  of  the  second  stage  is  twice  that  of  the  output 
side  of  the  first  stage,  but  note  that  one  of  these  out¬ 
put  nodes  can  be  busy  because  of  other  connections 
being  realized  in  the  network.  Thus, 

A(2)  =  2  A(l)  —  1. 

Similarly, 

Ai3)  =  2  Ai2)  -  2. 

In  general, 

A(i+1)  =  2  A(i)  —  2<~l. 

Since  the  initial  value  is  >1(1)  =  logN,  the  following 
closed  form  solutions  for  A(i)  can  be  obtained  for  i>2: 

Aii)  -  2*~i  logN  -  (*— 1)  2*~2. 

A 

Hence,  for  any  idle  input,  the  number  of  idle,  accessi¬ 
ble  output  nodes  of  the  middle  stage  is 

A(logN  +  1)  =  2loKjV(logjV)  -  logN  <2l°(S*-'). 

=  ];  NlogN. 

In  the  worst  case,  of  the  total  N\ogN  terminals  on  the 
output  side  of  the  middle  stage,  at  least  NlogN— 
(N — 1)  are  idle.  Of  these  idle  terminals,  at  least 
(1/2)  NlogN  can  be  accessed  by  any  idle  input.  Clearly, 
in  less  than  worst-case  conditions,  more  of  the  idle 
terminals  can  be  accessed.  By  symmetry  a  similar 
argument  can  be  made  for  any  idle  output  of  the  net¬ 
work  relative  to  the  terminals  on  the  input  side  of  the 
middle  stage.  Clearly,  this  implies  that  any  idle  input 
and  any  idle  output  can  be  interconnected  regardless 
of  whatever  other  interconnections  also  exist,  which 
is,  of  course,  the  desired  nonblocking  property. 


Pippenger33  has  developed  an  algorithm  for  finding 
an  interconnecting  path  through  a  Cantor  network 
which  requires  Oi logN)  operations. 

To  determine  the  number  of  crosspoints  in  an 
(N,N,N)-Cantor  network,  we  note  first  that  the  input 
stage  has  N  (1  XlogNVnetworks.  This  results  in 
NlogN  crosspoints.  Next,  we  note  that  the  next 
logN—  1  stages  (that  is,  all  the  intermediate  stages 
between— and  excluding— the  input  stage  and  the 
middle  stage)  each  have  2Nlog N  crosspoints.  The 
same  numbers  hold  for  the  output  side  of  the  middle 
stage  of  the  network.  Summing  the  number  of  cross- 
points  on  each  side  of  the  middle  stage  and  then  ad¬ 
ding  2NlogN  crosspoints  for  the  middle  stage  yields 

2  {NlogN  +  (logN-1)  (2  NlogN)  +  NlogN)  = 

4  N(logN)2  crosspoints. 

If  N  is  a  power  of  2,  then  an  |N,N,N)-Cantor  network 
has  less  crosspoints  than  an  (N,N,  AO-Clos  network  for 
all  N>2*.  A  still  further  reduction  can  be  made  to  pro¬ 
duce  Cantor  networks  with  0(N(logN— l)2)  cross- 
points  and  two  fewer  stages  of  delay.  Recently,  Pip- 
penger28  has  slightly  improved  this  bound  on  the 
number  of  crosspoints  in  a  Cantor  construction  to 
16N(logiN)2. 


Figure  12.  Input  side  of  an  (8,8,8)-Cantor  network. 
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Partitioned 


A  partitioner  is  a  circuit  switching  network  which 
performs  the  function  of  partitioning  a  set  of  devices 
usually  attached  to  the  inputs  of  the  network  into  dis¬ 
joint  subsets  so  that  the  devices  within  each  subset 
can  communicate  with  each  other  over  private  buses. 
Clearly,  the  most  straightforward  construction  of  a 
partitioner  is  an  (NX  (N/2)  )-net  work  where  the  de¬ 
vices  to  be  partitioned  are  attached  to  the  N  inputs. 
Notice  that  if  the  outputs  of  the  network  are  indis¬ 
tinguishable  in  their  use,  the  M2  horizontal  lines  of 
such  a  network  can  be  viewed  as  simply  M2  indis¬ 
tinguishable  buses  to  be  shared  by  the  subsets  of 
devices.  Then  the  function  of  the  (NX(M2)  l-network 
would  simply  be  to  interconnect  each  of  the  specified 
subsets  of  input  devices  to  a  distinct  bus  line,  where 
the  specification  of  the  input  subsets  could  either  be 
according  to  size  (if  the  input  devices  are  indis¬ 
tinguishable)  or  according  to  an  exact  listing  of  the  in¬ 
put  devices  (if  the  input  devices  are  distinguishable). 
Notice  also  that  if  the  outputs  were  attached  to 
distinguishable  output  devices  so  that  subsets  of  in¬ 
put  devices  had  to  be  connected  to  particular  buses  in 
the  system  operation,  the  (NX(M2))-network  could 
also  realize  this  type  of  partitioning  demand. 

Such  a  complete  crossbar  circuit  switch  realization 
of  a  partitioner,  however,  requires  0(M)  crosspoints. 
Hence,  in  spite  of  the  simplicity  of  control,  for  most 
systems  which  demand  partitioning  capabilities, 
alternative  designs  which  require  fewer  crosspoints 
must  be  considered.  Along  these  lines,  in  addition  to 


Table  1. 

Partitions  ol  live  identical  elements. 


Partitions  ol  Devices 

Number  ol  Subsystems 

( (D),  (D).  (D),  (D),  (D) ) 

5 

( (0,0),  (D),  (0),  (0) ) 

4 

( (0,0),  (0.0).  (D) ) 

3 

( (0,0.0,).  (0).  (0) ) 

3 

( (0,0.0),  (0,0) ) 

2 

( (D.D.D.D).  (0) ) 

2 

(0.0,0. 0.0) 

t 

the  other  issues  with  which  we  have  become  involved 
;n  our  consideration  of  circuit  switching  networks, 
two  further  issues  must  be  considered  for  parti- 
tioners:  we  must  establish  whether  or  not  the  devices 
to  be  partitioned  are  identical;  we  must  establish 
whether  or  not  the  outputs  of  the  partitioner  are  to  be 
specified. 

If  the  system  consists  of  identical  devices,  the  in¬ 
itial  formation  of  subsets  is  greatly  simplified 
because  our  concern  is  only  with  interconnecting 
unspecified  gToups  of  specified  sizes.  If  the  devices 
are  distinguishable,  for  example,  the  devices  might 
be  a  collection  of  processors  and  memories;  or  if  the 
devices  are  physically  identical  but  are  used  in  the 
processing  of  concurrent,  asynchronous  tasks  so  that 
their  states  make  them  functionally  distinguishable, 
then  the  partitioning  is  clearly  more  demanding. 

Similarly,  as  suggested  above,  the  outputs  must 
sometimes  be  specified  in  a  partitioning  request.  For 
example,  each  such  output  can  be  a  port  to  some 
specific  device  or  function,  or  it  could  be  a  signal  or 
data  source  required  by  one  of  the  subsets  of  devices 
in  the  partition.  Such  cases  are  clearly  more  com¬ 
plicated  to  realize  than  those  in  which  the  outputs  are 
indistinguishable. 

On  the  basis  of  the  above,  we  will  refer  in  the  follow¬ 
ing  to  (M  distinguishable  inputs,  M  distinguishable 
outputshpartitioners  as  (NDjWD)-partitioners;  (N 
distinguishable  inputs,  M  indistinguishable 
outputshpartitioners  as  (NDJk/D-partitioners;  (M  in- 
distinguishable  inputs,  M  distinguishable 
outpu t s)-parti tioners  as  [Nljrf D)-parti tioners;  and 
(AT  indistinguishable  inputs,  M  indistinguishable  out- 
puts)-partitioners  as  (N I .  AfD  parti tioners.  Further¬ 
more,  if  all  partitions  cannot  be  realized  by  a  parti¬ 
tioner,  we  will  indicate  this  by  referring  to  the  parti¬ 
tioner  as  incomplete.  Partitioning  can  also  be  per¬ 
formed  in  a  nonblocking  or  in  a  rearrangeable  man¬ 
ner.  Nonblocking  partitioning  involves  the  capability 
of  establishing  the  requested  interconnection  of  a 
subset  of  devices  without  disturbing  the  existing 
subset  interconnections  presently  implemented  by 
the  network.  Rearrangeability  implies  that  such 
disturbances  occur.  Indeed,  not  only  must  paths  be 
rearranged,  but  at  times  the  devices  composing  cur¬ 
rently  interconnected  subsets  must  also  be  rear¬ 
ranged.  Such  classification  of  networks  will  not  be 
stressed  in  the  following.  However,  in  some  of  the 
suggested  approaches  to  the  realization  of  partition¬ 
ing  demands,  the  use  of,  for  example,  connectors  will 
be  suggested;  it  will  be  apparent,  however,  that  even 
if,  say,  nonblocking  connectors  were  used  in  those 
situations,  nonblocking  partitioning  will  not  always 
result. 

The  simplest  partitioning  requirement  clearly  in¬ 
volves  indistinguishable  inputs  and  indistinguish¬ 
able  outputs.  In  fact,  for  such  a  requirement  and  N 
devices,  a  straightforward  serial  structure  withM—  1 
crosspoints  as  shown  explictly  in  Figure  1 3(a)  is  suffi¬ 
cient.  Note  that  we  take  advantage  of  the  fact  that 
the  outputs  are  indistinguishable  to  design  a  network 
with  effectively  no  outputs.  Still  simpler  structures 
are  possible.  For  example,  if  N=  5,  all  seven  possible 
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partitions  are  listed  in  Table  1.  Then  with  one  cross- 
point  and  the  hardwired,  fixed  links  as  shown  in 
Figure  13(b),  all  such  partitions  can  be  realized  under 
the  assumption  that  each  of  the  devices  can  be  placed 
in  a  subset  of  size  1  by  the  action  of  avoiding  or  ignor¬ 
ing  communication  with  other  devices  to  which  it 
might  be  hardwired  by  means  of  a  fixed  link. 

The  case  of  (Ad  .MD)- partitioning  adds  a  certain 
degree  of  complexity  to  the  requirements  to  be  satis¬ 
fied.  First  of  all,  given  N input  devices  and  M=N out¬ 
puts,  it  can  be  seen  that  the  total  number  of  partitions 
which  can  be  requested  is 

(2^-1 1  __L  (2A0! 

'  N  1  2  (Ad)2  ' 

With  Stirling's  formula,18  it  can  then  be  shown  that 

(2N~l)  ~  (22A'*  ■ 

So,  theoretically,  OiN)  crosspoints  are  required  for 
the  realization  of  an  (AiT.AfDj-partitioner. 

However,  other  than  a  complete  crossbar  network, 
the  only  construction  which  the  authors  are  aware  of 
requires  OlAflogAd  crosspoints.  As  illustrated  in 
Figure  14,  this  consists  of  an  [N,N,N)- connector, 
which  we  have  seen  can  be  realized  with  CXMogM 
crosspoints,  augmented  by  a  series  of  N—  1  cross- 
points  serially  linking  the  inputs.  The  operation  of 
this  network  in  providing  a  set  of,  say,  k  input  devices 
to  some  specified  output  is  to  simply  interconnect  a 
set  of  k  consecutive  input  devices,  and  then  provide  a 
path  through  the  (A,  A/,  Ad-  connector  from  any  one  of 
the  inputs  associated  with  this  set  to  the  specified 
output,  if  the  (N, N, AO-connector  is  a  Waksmah  net¬ 
work,21  the  resulting  (AI.MD)-partitioner  has 
4AflogA-3A+3  crosspoints.  A  similar  scheme  is 
possible  with  Benes’  3.8Aflog N  crosspoint  network. 

The  case  of  (AD,  M I)-partitioners  has  received  con¬ 
siderable  attention  in  the  literature.  Goke  and  Lipov- 
ski34  have  described  a  construction  for  such  parti- 
tioners,  which  they  have  referred  to  as  banyan  net¬ 
works.  Figure  15  shows  an  example  of  the  graph 
model  of  an  (8D,8I)-banyan  partitioner.  In  general, 
the  graph  model  of  a  banyan  network  is  a  Hasse 
diagram  of  partial  order  in  which  there  is  an  unique 
path  from  any  base  to  the  apex,  where  a  base  (apex)  is 
a  node  with  no  edges  incident  into  (out  from)  it. 

In  this  figure,  the  bottom  nodes,  which  represent 
inputs  that  are  assumed  to  be  attached  to  the  devices 
to  be  partitioned,  are  the  bases  and  the  top  nodes  are 
the  apexes.  Because  of  the  generality  of  its  definition, 

'  many  types  of  networks  are  included  in  the  class  of 
banyan  networks.  But  those  banyan  networks  which 
are  of  practical  interest  do  not  usually  have  full  parti¬ 
tioning  capabilities,  so  that  Goke  and  Lipovski  con¬ 
sidered  parallel  and  multiplexed  usage  of  banyan  net¬ 
works  to  obtain  a  full  partitioning  capability.  Some  of 
the  motivations  for  considering  banyans  are  that 
such  networks  have  (XMogAO  crosspoints  and  only 
OdogM  delay.  In  addition,  configuring  the  banyan 
network  to  satisfy  a  device  subset  interconnection  re¬ 
quest  is  straightforward  and  can  be  accomplished  in 


O(logA)  operations.  Finally,  given  an  existing  state  of 
a  banyan  network,  if  a  new  request  for  a  subset  inter¬ 
connection  is  made  by  a  subset  of  the  idle  input 
devices,  and  if  the  resulting  partition  consisting  of 
the  current  realized  subset  interconnections  plus  the 
new  subset  interconnection  request  is  realizable  by 
the  network  (recall  that  the  banyan  network  is  an  in¬ 
complete  partitioner),  then  the  banyan  can  provide 
the  new  subset  interconnection  request  without 
disturbing  the  existing  subset  interconnections. 

Recently,  Thompson13  has  suggested  the  use  of  his 
expanders  as  (AD,Al)-partitioners  with  full  parti¬ 
tioning  capability.  An  expander  is  similar  to  a  con¬ 
nector  except  that  it  has  the  added  interconnection 
capability  of  allowing  inputs  to  be  interconnected  to 
any  number  of  outputs,  while  still  requiring  that  each 
output  be  interconnected  to  at  most  one  input. 
Hence,  just  as  an  (A, A,  Ad-connector  can  realize  all  Ad 


V  DISTINGUISHABLE  OUTPUTS 


N  IDENTICAL  DEVICES  WITH  N- 1 
SWITCHES  BETWEEN  THEM 


Figure  14.  (NI.ND)  partitioner. 


Figure  15.  (8D,8l>-banyan  partitioner. 
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permutation  mappings  of  the  inputs  onto  the  inputs, 
then  an  (AWAO-expander  can  realize  all  NN  general 
mappings  of  the  inputs  to  the  outputs.  Thompson's 
design  is  to  simply  attach  the  devices  to  be  parti¬ 
tioned  to  the  outputs  of  one  of  his  expanders.  Since 
the  inputs  of  his  expander  (the  outputs  of  the  parti- 
tioner)  are  now  indistinguishable,  as  far  as  the  parti- 
turner's  operation  is  concerned,  a  part  of  the  input 
side  of  the  expander  can  be  deleted,  resulting  in  an 
(ATD,  (AT/2)I)-partitioner  with  6MogV  crosspoints. 
Slight  improvements  to  this  partitioning  scheme  are 
possible  by  using  the  W,Af,N)-Waksman  connection 
as  part  of  Thompson's  expander  or  by  using  an 
(Af.AT.AO-Benes  connector1  where  Nis  a  power  of  3  and 
three-way  branching  is  used  throughout. 

It  is  also  possible  to  use  the  W.N.M-connector 
ugu  mented  by  N—  1  serial  crosspoints  on  the  output 
side  or  full  partitioning  capability.  This  is  simply  the 
••everse  of  the  network  proposed  for  the  (ArI,ArDHpar- 


Flgurs  16.  Complete  (NO,  NI)-partltioner  with  4NlogN 
-3N+  1  crosspoints. 


Figure  17.  Schematic  view  of  an  SIMD  computer. 


titioner.  Such  an  (A'D.A'  I  l-partitioner  is  illustrated  in 
Figure  16.  Such  complete  partitioners  require 
4MogN — 3N+3  crosspoints,  but  they  do  not  provide 
the  nonblocking  property.  Again,  a  similar  scheme  is 
possible  with  Benes’  3.8Mog N  crosspoint  network. 

1 1  is  also  possible  to  construct  a  partitioner  from  an 
OiNlogN)  crosspoint  (Af, N, A^-connector  with  feed¬ 
back.54  A  clear  disadvantage  of  this  design  of  a  parti¬ 
tioner  is  that  the  paths  which  serve  as  buses  for 
subset  interconnection  iterate  around  and  through 
the  network,  introducing  significant  delays.  How¬ 
ever,  with  an  (Ar.AI.M-Cantor  network,  nonblocking 
partitioning  can  be  achieved. 

Finally,  it  should  be  clear  that  (ND.AfD)-par- 
titioners  can  be  realized  with  (M.A^A^-expanders, 
where  the  devices  to  be  partitioned  are  attached  to 
the  outputs  of  the  expander. 


SIMD  interconnectors, expanders,  sorters. . . 

There  are  still  large,  open  areas  in  our  circuit 
switching  sampler;  but  to  fill  them  in  would  go  far 
beyond  our  page  limitations.  Accordingly,  we  will 
conclude  by  mentioning  just  a  few  of  the  many  other 
possible  designs. 

For  data  exchanges  between  processing  elements 
in  classes  of  computer  systems  having  structures 
referred  to  as  single-instruction  multiple-data- stream 
architectures,  a  number  of  interesting  interconnec¬ 
tion  patterns  have  been  developed.  Figure  17  shows  a 
schematic  view  of  an  SIMD-computer.  The  intercon¬ 
nection  network  shown  is  not  necessarily  a  separate 
functional  unit  in  an  actual  SIMD  computer,  but  the 
implementation  of  interconnection  patterns  is  a  fun¬ 
damental  aspect  of  SIMD-computer  operation.  Ac¬ 
cordingly,  these  interconnection  patterns  can  be 
described  in  terms  of  single-stage  interconnection 
networks  containing  from  0(N]  to  OiNXogN)  cross- 
points.  These  networks  will  be  referred  to  as  SIMD- 
interconnectors. 

The  N  inputs  and  the  N  outputs  of  the  SIMD- 
interconnectors  can  be  considered  to  be  attached  to 
the  N  processing  elements  so  that,  say,  processing 
element  i  can  transmit  data  to  the  processing 
elements  ijt, ...  ,1^  if  input  i  has  crosspoints  between 
it  and  outputs  j\, . .  ,Jh.  Since  no  input  of  the  SIMD- 
interconnector  will  have  crosspoints  between  it  and 
all  the  outputs,  to  perform  a  data  exchange  between 
specified  pairs  of  processing  elements,  a  control  unit 
must  place  the  SIMD-interconnector  in  a  sequence  of 
states  so  that  the  desired  data  exchange  is  ac¬ 
complished  by  a  series  of  data  exchanges  through  in¬ 
termediate  processing  elements.  Clearly,  for  SIMD- 
interconnectors,  complexity  of  the  control  algorithm 
and  the  time  delay  required  for  data  exchanges  are  of 
major  significance. 

The  general  states  which  the  SIMD-interconnector 
can  assume  are  usually  described  by  functions  which 
map  the  inputs  onto  the  outputs.  However,  whether 
or  not  a  specific  data  exchange  indicated  by  a  func¬ 
tion  actually  occurs  in  system  operation  depends 
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upon  whether  the  processing  elements  involved  are  in 
active  or  inactive  states.  This  is  determined  by  a 
masking  scheme.  As  for  the  details  of  these  masking 
schemes,  suffice  it  to  say  that  from  our  perspective, 
an  inactive  processing  element  implies  that  the  cross- 
points  between  it  and  the  outputs  in  the  SIMD- 
interconnector  which  are  involved  in  the  realization 
of  some  function  are  considered  to  be  open  (that  is,  no 
data  is  transmitted),  whereas  those  crosspoints 
relative  to  an  active  processing  element  are  con¬ 
sidered  to  be  closed  (that  is,  data  is  transmitted). 
Hence,  the  complete  data  exchange  indicated  by  a 
function  does  not  occur. 

There  are  five  principal  SIMD-interconnectors 
which  have  been  discussed  in  the  literature.  These 
five  are  all  closely  related;  in  fact,  Siegel3*  has  shown 
that  they  are  all  effectively  equivalent  in  the  sense 
that  each  can  simulate  the  interconnecting  capa¬ 
bilities  of  the  others.  Since  SIMD-interconnectors 
are  discussed  in  a  companion  paper  in  this  issue  of 
Computer,  we  will  only  reference,  for  completeness, 
these  five  principal  networks  here:  the  shuffle- 
exchange  network,25-31-31-3®  which  has  2JV crosspoints; 
the  cube  network,37  which  has  N\ oglV crosspoints;  the 
Illiac  network3®-38-*1  which  has  41V  crosspoints;  the 
plus-minus  2*  network,  3®-42-43  which  has  OfMogJV) 
crosspoints;  and  the  wraparound  plus-minus  2*  net¬ 
work.  which  has  CXAlogN)  crosspoints. 

Expanders  (also  referred  to  as  generalized  connec¬ 
tion  networks)  have  already  been  mentioned  as  cir¬ 
cuit  switching  networks  which  realize  (assuming 
there  are  N  inputs  and  N outputs)  all  NN  mappings  of 
the  inputs  onto  the  outputs.  In  other  words,  one-to- 
many  assignments  are  permitted  and  the  expanders 
have  a  fanout  capability.  Such  networks  are  more 
powerful  than  connectors  which  realize  all  N\  one-to- 
one  mappings.  Indeed,  it  would  be  possible  here  to 
present  for  expanders  a  section  analogous  to  that  on 
connectors;  unfortunately,  only  the  following  very 
brief  sketch  is  possible.  Ofman14  originally  showed 
the  existence  of  a  1  OAT og/V crosspoint  expander  with 
51og/V  delay;  Pippenger  improved  this  result  by 
showing  the  existence  of  a  7.6A/loglV crossppoint  ex¬ 
pander  with  an  O(NlogN)  delay;  Thompson13  then 
showed  the  existence  of  the  same  size  network  with 
only  3.8logA7  delay.  Recently,  Pippenger,44  utilizing 
his  existence  results  for  concentrators  and  super¬ 
concentrators.  placed  a  lower  bound  of  l.9NlogN 
crosspoints  and  an  upper  bound  of  3.8A/logfV  cross- 
points  on  expanders  with  CHOogAO2)  delays.  Regar¬ 
ding  constructions,  Masson  and  Jordan45  have 
shown  explicit  designs  for  0(JV'5l'3>  crosspoint  net¬ 
works  with  only  three  delays.  Repeated  decomposi¬ 
tions  using  this  design  result  in  rearrangeable  ex¬ 
panders  with  OiN3li\ogN)  crosspoints  and  nonblock¬ 
ing  expanders  with  logA')'og6,1o®3)  crosspoints. 

Further  work  along  these  lines  has  been  done  by 
Masson,4®  Hwang,47  and  Asthana.4*  Pippenger33 
gives  a  non-uniform  rearrangeable  expander  using 
feedback.  It  is  simply  constructed  from  two 
(Af,N,AT>-connectors,  one  used  specifically  for  feed¬ 
back.  Hence,  an  interconnecting  path  involving  one 
input  and  many  (perhaps  all)  outputs  can  iterate 
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through  the  network  up  to  OtMogTV)  times.  Clearly, 
the  paths  through  such  a  network  are  non-uniform, 
but  by  the  work  of  Tsao-Wu  and  Opferman,22  a  state 
realizing  a  total  assignment  can  be  determined  with 
OlA/logA)  operations.  Finally,  Masson6  has  also 
shown  some  expander  constructions  by  using  stages 
of  the  binomial  concentrators.  Pippenger33  uses  a 
similar  idea  to  show  that  0(MlogA/)t'+1)  crosspoint 
expanders  can  be  constructed  if  0{NI\ogN)v)  cross- 
point  concentrators  are  used.  Since  a  connector  cer¬ 
tainly  operates  as  a  concentrator,  we  can  therefore 
use  this  technique  to  construct  an  CHN(\ogN)3)  cross- 
point  expander  by  using  Cantor’s  CHMlogN)2)  cross- 
point  connectors.  Similarly,  this  approach  shows 
that  if  a  state  satisfying  a  new  request  can  be  deter¬ 
mined  with  0((loglV)“)  operations  for  the  concen¬ 
trators,  then  the  resulting  expander  requires 
OfflogAO1"  1 )  operations  to  do  the  same. 

Sorting  is  such  a  basic  operation  and  so  similar  in 
nature  to  connecting,  that  as  might  be  expected,  sort¬ 
ing  networks,  or  sorters,  represent  an  area  unto 
themselves.  The  primary  difference  between  a  sorter 
and  a  connector  is  that  the  former  is  made  up  of  com- 
parater  cells,  which  for  our  purposes  can  be  con¬ 
sidered  to  be  analogous  to  (2  X  2)-networks.  Hence  the 
control  of  sorters  can  be  thought  of  as  being  dis¬ 
tributed  to  each  such  cell.  Also  like  nonblocking  con¬ 
nectors,  there  is  no  reason  not  to  expect  CXNlogN) 
crosspoint  designs;  but,  again,  the  minimal  known 
construction  of  a  sorter,  which  utilizes  an  interesting 
odd-even  merging  concept,  is  the  Batcher  sorter49 
which  has  OiMlogTV)2)  crosspoints.  However,  it  has 
been  shown4  that  the  Batcher  sorter  is  not  minimal 
for  7V=9,10,12,  and  16,  which  places  in  suspicion  the 
minimality  of  the  Batcher  sorter  for  arbitrary  N.  Mar¬ 
cus350  observes  and  discusses  the  interesting  dif¬ 
ference  in  the  OiNlogN)  growth  of  the  computation 
time  of  software  programs  that  sort  as  compared  to 
this  CHWogN)2)  growth  of  crosspoints  in  sorters,  and 
attributes  the  discrepancy  to  the  adaptive  nature  of 
the  former  as  opposed  to  the  fixed  structure  of  the  lat¬ 
ter.  Other  significant  sorter  designs  and  studies  are 
due  to  Bose  and  Nelson51  and  Van  Voorhis5265;  some 
of  Van  Voorhis’  work  also  yields  sorter  designs  with 
the  same  order  of  growth  but  yet  fewer  crosspoints 
than  the  Batcher  sorter. 

And  there  are  many  other  circuit  switching  net¬ 
works  and  issues  which  our  sampler  could  display: 
cellular  interconnection  arrays56-67;  programmable 
indexing  networks56;  seldom  blocking  networks  and 
connectors  with  finite  blocking  probability50-59;  time- 
division  circuit  switching  networks  and  time-slot  in- 
terchangers60-60"62;  and,  alas,  the  list  could  go  on  and 
on.  We  will  cease  our  presentation  here  with  the 
following  comments.  Much  of  the  work  which  now 
represents  the  state  of  the  circuit  switching  area  was 
motivated  by  general  communications-oriented  prob¬ 
lems;  indeed,  this  still  represents  much  of  the  motiva¬ 
tion  for  research  currently  in  progress.  However,  a 
new  influence  on  such  research— which  some  say  will 
eventually  be  the  influence— is  the  development  of 
LSI/VLSI  devices  to  the  point  where  very  large  net¬ 
works  of  such  devices  are  practical,  perhaps  even 


unavoidable.  Some  type  of  interconnector  chip  could 
eventually  be  as  common  in  such  system  designs  as 
processor  chips,  memory  chips,  and  I/O  chips  are  to¬ 
day.  Before  such  a  point  is  reached,  however,  many 
questions  need  to  be  addressed,  for,  clearly,  the  ap¬ 
plicability  of  a  circuit  switching  network  to  a  system 
design  is  related  to  the  details  of  the  application.  For 
example,  what  are  the  generic  circuit  switching  net¬ 
work  requirements  for,  say,  various  types  of  fault- 
tolerant  computing  designs  as  opposed  to  that  re¬ 
quired  for,  say,  various  types  of  vector  processing  ar¬ 
rays?  Some  preliminary  investigations  along  these 
lines  have  been  reported  63*8;  clearly,  however,  there 
is  a  long  way  to  go.  Moreover,  the  treatment  of  such 
questions  seems  to  demand  an  intriguing  blend  of 
computer  science  and  computer  engineering  re¬ 
search;  for  example,  complexity  theory  and  digital 
systems  structures  are  intimately  related  to  such 
considerations.  It  is  an  exciting  challenge,  and  one 
that  cannot  be  ignored.  It  is  only  necessary  to  scan 
the  modern  literature  describing  telecommunication, 
computer/peripheral,  or  instrumentation  complexes 
to  see  this,  for  inevitably  in  a  schematic  or  illustration 
there  will  be  that  operationally  significant  box  label¬ 
ed  “circuit  switching  network.”  ■ 
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ABSTRACT 

In  this  paper,  a  capacity  calculation  technique  is  described  for  a 
class  of  interconnection  networks  called  composite  concentrators.  This 
technique  is  then  used  to  develop  some  capacity  observations  for  various 
concentrators  of  this  class. 


INTRODUCTION 

A  concentrator  is  a  type  of  interconnection  network  having  switching 
elements,  called  crosspoints,  between  disjoint  sets  of  inputs  and  outputs 
to  the  network  such  that  specified  sets  of  inputs  of  some  maximum  size  can 
be  interconnected  to  arbitrary  sets  of  outputs  of  the  same  size  ID.  More 
specifically,  a  concentrator  vith  N  inputs  and  M  outputs,  N  >  M  ,  is 
said  to  have  a  capacity  c  <  M  if,  for  any  choice  of  K  c  inputs,  a  set 
of  K  disjoint  paths  from  the  inputs  can  be  established  by  closing  cross- 
points  to  some  K  outputs.  It  is  important  to  note  that  the  inputs  can  be 
specified,  but  the  outputs  to  which  they  are  connected  cannot  be  specified. 
Clearly,  if  crosspoints  were  placed  between  every  input  and  output,  the 
capacity  would  be  equal  to  the  number  of  outputs.  (Indeed,  for  such  a  net¬ 
work,  the  outputs  could  also  be  specified.)  However,  in  general ,  the  num¬ 
ber  of  crosspoints  would  be  prohibitively  high;  hence,  sparser  constructions 
are  of  interest.  The  problem  with  which  we  deal  in  this  paper  is  the  capac¬ 
ity  calculation  of  a  class  of  one-stage  sparse  crosspoint  networks  called 
composite  concentrators. 

Preliminaries 

Figure  1  shows  three  examples  of  one-stage  (or  bipartite)  sparse  cross- 
point  networks.  Such  networks  can  be  expressed  formally  as  a  triplet 
(1,0,R),  where  I  is  the  set  of  inputs,  0  is  the  set  of  outputs,  and  R 
is  a  relation  detailing  the  crosspoint  placement  between  the  inputs  and  the 
outputs.  R  can  be  given  in  the  form  of  a  graph,  or  as  a  crosspoint  model 
(as  is  done  in  Figure  1),  or  as  a  set  of  ordered  pairs  where  (i,j)cR  if 
there  is  a  crosspoint  between  input  i  and  output  j  .  Regarding  the 
latter,  we  can  express  the  network  of  Figure  1(a)  as 

({1,2. 3, 4,5},  {a ,b,c) ,  { (l,a) , <l,c) , (2,b) , <3,c) , (4 ,b) , (4, c) , (5, a) , (5,b> })  . 

Similarly,  the  networks  of  Figures  1(b)  and  1(c)  can  respectively  be 
expressed  as 

({  1,2, 3, 4, 5, 6} , {a ,b,c,d)  ,{ (l,a)  ,  (1 ,b)  , (2, a) , (2,c) , (3, a) , (3,d) , (4,b) , (4,c) , 

(5,b) , (5,d> , <6,c) , (6,d) }) 

and 

({l,2,3),{a,b,c},{ (l,a),(l,b), (2,a),(2,c), (3,b),(3,c)})  . 

It  should  be  noted  that  the  networks  of  Figures  1 (a)  and  1 (b)  satisfy 
the  implicit  requirement  for  concentrators  that  the  number  of  inputs  is 
greater  than  the  number  of  outputs  and  (since  the  capacity  must  be  greater 
than  or  equal  to  one  and  all  outputs  must  be  of  some  use  in  providing 
interconnecting  paths  from  the  inputs  through  the  network)  that  each  input 
(output)  can  potentially  be  connected  to  at  least  one  output  (input).  Mora 
formally,  we  can  express  one-stage  concentrators  as  triplets  (l,0,u)  whera 
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1 1 1 > 1 0 1  ,  and  where  u  (u"l)  is  a  relation  mapping  elements  of  1(0)  to 
subsets  of  0(1)  correspondi ng  to  the  crosspoint  placement  such  that 
u(i)/$  (u'*(o)F4)  for  all  icl  (ocO)  . 


12  3  AS 


(  C  ) 


*  )  2  3  4  5  6 

a 

b 
c 
d 


(  B  ) 

Figure  1 

It  should  also  be  pointed  out  for  later  reference  that  Figure  1(b)  is 
a  basic  binomial  concentrator  which  we  will  describe  as  a  (^ ) -concentrator 
since  the  crosspoint  pattern  connecting  the  outputs  to  the  inputs  consists 
of  all  possible  (|)  choices  of  2  crosspoints  between  the  inputs  and  the 
4  outputs.  In  general,  an  (*) -concentrator  has  (*)-inputs,  x-outputs,  y  (y) 
crosspoints.  It  should  also  be  noted  that  the  network  of  Figure  1(c)  is 
(J) -network,  but  it  is  not  a  concentrator. 

The  concentrator  of  Figure  1(a)  has  a  capacity  of  2  and  the  {^)* 
concentrator  of  Figure  1(b)  has  a  capacity  of  4  .  However,  the  capacity 
of  a  sparse  crosspoint  networks  which  satisfies  the  implicit  requirements 
for  a  concentrator  is,  in  general,  very  difficult  to  determine.  Clearly, 
one  way  to  determine  the  capacity  is  to  exhaustively  consider  subsets  of 
inputs  (starting  with  subsets  of  size  two  and  progressing  upwards)  until  a 
subset  is  found  for  which  there  is  not  a  one— to-one  mapping  of  the  inputs  to 
some  subset  of  the  outputs.  However,  even  for  networks  of  modest  size  this 
is  prohibitive.  Hence,  in  this  paper  we  will  develop  a  much  more  efficient 
capacity  calculation  technique  with  which  to  address  this  problem  for  a 
class  of  concentrators  called  composite  concentrators. 

Isolation  Concepts 

To  develop  our  capacity  calculation  scheme  it  is  necessary  that  we 
first  define  the  following  concepts. 

For  a  concentrator  expressed  as  (lr0,u),  if  u(i)c0*c0  for  some 
icl  ,  then  we  will  say  that  i  is  completely  enclosed  by  O'  ,  or  O' 
isolates  i  .  For  a  given  subset  O'cO  ,  the  set  o(O')  is: 

<7  (0  * )  -  {ijicl  and  u(i)cO')  • 

o(0#)  is  called  the  isolated  set  of  0'  ,  since  elements  of  o(O')  cannot 
access  any  output  except  those  which  are  the  elements  of  0*  . 

For  all  subsets  O'  of  0  for  which  )o*)=k  ,  we  define  an  isolation 


1017 


number  to  be: 


Hr  =  max  | 0(0*) |  . 

|0<j=k  and  O'cO 

Finally,  the  isolation  vector,  • • • ,h i  i )  ,  is  determined  by  taking 

the  differences  of  the  isolation  numbers  as*  ' follows: 

where  l<j<|o|  and  H  =0  . 

3  3  3-1  1  1  0 

k  jo| 

Obviously  J  hj=Hk  *  and  J  hj=H|o|=hl  * 

Example  1:  Consider  the  concentrator  of  Figure  1(a).  By  inspecting  the 

relation  u  it  is  seen  that  {a,b}  isolates  (2,5)  ,  because  u (2)={b)c{a,b). 
and  u (5)={a,b}c{a,b}  . 

Since  |u(2) j= |u(3) |=1  but  u(2)^u(3)  and  |u(j) |>1  for  je{l,4,5)  ,  it 
follows  that  H^=l  .  Similarly  H2=3  ,  and  1*3=5=  1 1 1  .  Hence  the  isolation 
vector  of  this  concentrator  is  (h^,h2,h^)- (1,2,2)  . 

Capacity  Calculation 

We  can  now  give  the  following: 


Theorem  1 s  The  capacity  of  a  concentrator  is  the  minimum  value  of  c 
which  satisfies 

c 

H  =  7  h. >c  , 

C  i=l  1 

where  (h^ ,h2 , . . . ,h | Q | )  is  the  isolation  vector  of  the  concentrator. 

Proof :  If  c  is  the  minimum  value  which  satisfies  Hc>c  ,  then  any  H. 

for  l<j <c  is  less  than  or  equal  to  j  ,  that  is  ,  H .  for  2<j<c  .  3 

Therefore  any  subset  I*  of  I  ,  where  | I ■ | — j<c  ,  haS  crosspoints  to  a 
subset  0'=^Uj ,u(i)<0  and  |o*|>jj  •  Thus  from  Hall's  Theorem  (see  Appendix), 

any  subset  Ifci  of  size  up  to  and  including  c-1  has  some  subset  0'£0 
where  there  is  a  one-to-one  correspondence  between  I*  and  O'  .  Now  from 
the  requirement  on  the  concentrator  that  for  any  iel  ,  u(i)/$  ,  it  follows 
that  for  the  given  subset  O'  ,  every  element  ijfO(0*)  is  connected  at 
least  one  element  in  0  which  is  not  in  0*  ;  that  is, 

for  all  i^o(O')  ,  u (i) n (0-0' )?$  . 


Therefore  for  any  subset  I 'ci  up  to  and  including  size  c, 
subset  of  0'c0  which  has  one-to-one  correspondence  with 
capacity  is  c  . 


there  is  some 
I '  .  Hence  the 

Q.E.D. 


Example  2:  The  isolation  vector  of  the  concentrator  of  Figure  1(a)  is 

(1,2,2). 

H1=h1  =  1i1  ^1*  H2=h1+h2=l  +  2=3>2  . 


Therefore  the  capacity  of  the  concentrator  is  2  . 


By  making  use  of  Theorem  1  it  is  clearly  straightforward  to  calculate  the 
capacity  of  a  concentrator  when  the  isolation  vector  or  the  values  of  the 
isolation  numbers  Hj,l<j<Jo|  ,  are  known.  But,  in  general,  this  informa¬ 
tion  is  not  easy  to  obtain.  However,  for  some  special  cases,  such  as  bi¬ 
nomial  concentrators,  we  can  give  the  following. 

Lemma  1:  The  isolation  vector  of  a  (*) -concentrator  is 


“y-l’-'y-l’ 


Vi  * 


x-1 

y-1 


))  . 
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Proof :  An  (*) -concentrator  has  (*)  input  lines,  while  an  (*y  )- 

concentrator  Iwhich  is  contained  in  an  (S) -concentrator)  has  one  less  output 
line  than  the  (*) -concentrator  and  only  1x~i)  input  lines.  Therefore  at 
nost  (*y^>  input  lines  are  isolated  by  x-1  output  lines.  That  is  Hx_j  = 
(’‘y1)  •  Now  from  the  formula  (g)  =  (“j1)*  (§li)  ,  we  have  that 

vvvr‘y,)'tVl=(HJ  • 

By  using  this  equation  recursively  the  isolation  vector  is  obtained  in 
descending  order. 

Q.E.D. 


With  Lenxaa  1,  we  can  prove  the  following. 


Theorem  2  (Masson  12]):  The  capacity  of  (*) -concentrator  is  y+2  . 

Proof :  Since  in  a  concentrator,  the  number  of  inputs  is  always  larger  than 

the  number  of  outputs,  it  follows  that  ac>y+2  • 


From  Lesma  1 

and 

Note  that 


Vi=‘yy1>=y+1  * 
V2=<yy 2,=*r  ty+2)  ’ 
Vi=y+1<y+2<*T(y+2>=V2 


Hence,  the  capacity  is  y+2  . 


Q.E.D. 


Composite  Concentrators 


Given  two  one-stage  sparse  crosspoint  networks,  say,  A:(Iy,0ltUy) 
and  B: O2.02»u2)  ,  we  define  the  composite  one-stage  sparse  crosspoint  as 
A-B:  (11xI2,01x02,ux2)  or  more  simply  Ul2'°12>u12>  where  I12  is  the 

Cartesian  product  of  Zy  and  I2  and  0j2  is  the  Cartesian  product  of 
°1  and  02  ,  and  where  u12:  l1xl2*01x02  such  that 


U12(ili2Mul,il)XU2U2,) 


where 

u,(i,) 


ilcll 

and 


and 

u2(i2) 


i2cl2 


and  ui (i2)xu2(i2)  is  the  Cartesian  product  of 
The  networks  A  and  B  will  be  referred  to  as  the 


component  networks  of  the  composite  network  A • B 


Example  3:  Let  A  denote  the  network  of  Figure  1 (a)  and  let  B  denote 
the  network  of  Figure  1(c). 

A:  ({1,2,3, 4, 5},  {a,b,c),  u^) 

B:  ({1,2,3),  {a,b,c} ,  u2) 


3 

- a 

Hr -  b 


Then  the  composite  network  A-B  has  15  inputs  and  9  outputs  and  a  compos¬ 
ite  relation  u12 

A-B:  ({(1,1), (1,2), (1,3), (2,1), (2, 2), (2, 3), (3,1), (3, 2), (3, 3), (4,1), <4, 2), 

(4,3)  , (5,1) , (5,2) , (5,3) ) ,{ (a, a) , (a,b) , (a,c) , (b,a) , (b,b) ,  (b,c) ,  (c,a) 
(c,b) , (c,c) )*u12> 
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11  12  13  21  22  23  31  32  33  41  42  43  51  52  53 


Note  that  A*B  is  a  composite  concentrator.  Hence  if  we  knew  the  iso-* 
lation  vector  of  this  composite  concentrator,  by  Theorem  1  we  could  deter¬ 
mine  the  capacity.  However  even  though  the  isolation  vectors  of  the  com¬ 
ponents  are  known,  the  isolation  vector  of  the  components  is  not  apparent. 
Hence,  we  will  now  define  the  concept  of  an  isolation  array  with  which  to 
address  this  problem. 

In  general,  suppose  that  two  networks  P  and  Q  have  following  isolation 
vectors:  <Pi*P2' - • * #Pn)  and  ^1*^2' •• •  Th«n  the  isolation  array  for 
the  composite  network  P*Q  is  an  mxn  array  whose  i^  row,  column  ele¬ 
ment  is  equal  to  Pj^j  •  In  other  words,  it  is  the  following  vector  product 
of  isolation  vectors: 


(0,1,2) 


For  later  reference,  we  will  denote  the  i^  row,  column  element  of 


the  isolation  array  for  the  composite  network  P*Q  as  h^j  .  Then  if  P 
has  m  outputs  and  Q  has  n  outputs,  the  isolation  array  is 
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Capacity  of  Composite  Concentrators 

We  will  present  in  this  section  a  theorem  which  is  the  basis  of  a  ca¬ 
pacity  calculation  technique  for  a  class  of  composite  concentrators.  How¬ 
ever,  before  doing  this,  we  must  first  develop  the  concept  of  a  smooth  net¬ 
work. 


,  We  will  define  a  network,  say,  P:  (1,0, u)  to  be  smooth  if,  given  the 
network's  isolation  numbers  .Hj, . . . ,H| ,  there  exists  a  sequence  of 
| 0 |  subsets  of  0  ,  denoted  as  01( 02, . ; . Jo | Q j  ,  where 


and 


vw 

|  0(0.)  I  =  H. 
3  1  3 


Example  5:  The  concentrator  of  Figure  1(a)  (for  which  H  =1,H  =3,  and  H=5) 

is  smooth  as  can  be  seen  by  examining  0  ={b],  0  ={b,c)  ,  and  20  ={a,b,cl 
since  o((b})={2},  o <{b,c ))={ 2, 3, 4} ,o <{a,b,c) )  =  {l, 2, 3,4, 5}  .  3 


Finally,  we  will  define  an  m(n)=partition  of  an  integer  c  to  be  a 
partition  of  c  into  m  integer  parts,  r j.rj, . . . ,rm,  such  that 
ri+r2+*  - -+rro=c,  0<r j^n,  and  ri>rj  for  i<j  .  We  can  now  state  the  followong 

Theorem  3;  Given  the  isolation  array  of  the  composite  network  P*Q  where 
the  isolation  vector  of  P  is  (p2,...,p]n)  and  the  isolation  vector  of  Q 
is  and  at  least  one  of  these  two  component  networks  is  smooth, 

then  the  capacity  of  P-Q  is  given  by  the  minimum  value  of  c  which  satisfies 
max  rl  r2  rm 


over  all  m(n) -partitions  (  [ 


of  c 


3=1 


1. 

3 


I 


3  =  1 


^  l  h )>c 
.  ,  m . 

3  =  1  3 


The  proof  of  Theorem  3  will  be  omitted  in  this  version  of  this  paper 
because  of  its  length  (3J. 


It  should  be  noted  that  the  complexity  of  calculating  the  capacity  of 
a  P-Q  composite  concentrator  by  repeatedly  using  Theorem  3  for  increasing 
values  of  c  can  be  shown  to  be  0(N3^2)  (5). 


Example  6:  For  the  composite  concentrator  A*B  of  Example  3,  the  3(3)- 

partition  which  satisfies  Theorem  3  is:  3,3,0  .  Hence,  for  the  composite 
concentrator  A*B,  we  have  that  c=6  .  It  should  also  be  noted  that  using 
Theorem  3  repeatedly  by  starting  with  c=l  and  programming  to  c=6  results 
in  a  consideration  of  15  partitions. 


Capacity  Observations 

Using  Theorem  3,  a  wide  variety  of  interesting  capacity  observations 
can  be  made.  For  example,  a  P*Q  composite  concentrator  where  P  is  a 
(“) -concentrator  and  Q  is  a  (*) -concentrator  is  called  a  (“) (y)" 
concentrator.  Formulas  for  the  capacity  in  terms  of  the  parameters  u,v,x, 
and  y  for  all  cases  of  interest  are  listed  in  Table  1  and  shown  graphically 
in  Figures  2(a)-2(d). 


Higher  dimensional  composite  concentrators  can  also  be  considered.  For 
example,  we  can  determine  the  capacity  of  a  (2) (2) (2) -concentrator,  or  more 
simply,  a  (J) ^-concentrator  using  a  3-dimensional  isolation  matrix.  Simi¬ 
larly  (2) "-concentrators  can  be  considered.  Table  2  gives  the  capac 
(J) "-concentrators  for  l<n<8  ,  x>4  . 


capacity  of 


Finally,  Figure  3  shows  a  3  dimensional  isolation  array  for  (2) (2) (J)- 
concentrators.  Some  rather  interesting  capacity  observations  for  this  3 
dimensional  composite  concentrator  are  the  following:  for  u>4,  w=4,  and 
y=5,  c=79;  for  u>2,  w»5,  and  y=5,  c=74:  for  u>2,  u>6,  and  y»5,  c*58;  and 
for  uj^2,  w>4,  and  y>6,  c=48. 
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U  X 

(  )  (  )  -  Concentrator 
v  y 


2«v±y  and  (“><*)> ux 


Case  X.  5 

Case  1.1. 

Case  1.'2. 

Case  1.2.1. 

Case  1.2.2. 

Case  2.  i 

Case  2.1. 

Case  2.1.1. 

Case  2.1.2. 

Case  2.2. 

Case  2.2.1. 

Case  2.2.2. 

Case  2. 2. 2.1. 
Case  2. 2. 2. 2. 
Case  2.2.3. 

Case  2. 2. 3.1. 
Case  2. 2. 3.2. 
Case  2.2.4. 

Case  2. 2. 4.1. 
Case  2. 2. 4. 2. 


y=2 

u*fv  and  x¥y 
u=v  or  x=y 


c=(v+l)  (y+2)-l 


u**v 

c=v (y+4 ) 

x=y 

c= (v+4 ) y 

fc2 

v*y 

u^v+2 

c= (v+1) (y+2)-X 

u>v+2 

c=v ly+3) 

v%y 

where  u  is 

x=y 

c=uy  .  . 

minimum  anc 

x=y+l 

2y>v(v+l) 

c=  (v+2)  (y+1) 

2y^v (v+1) 

c= (v+2) (y+l)-l 

x=y+2 

2v>y 

C=(v+1)  (y+2)-l 

2v^y 

c=v (y+2) 

x^y+3 

2v>y 

c=v(y+3) 

2v^y 

Table  1 

c=v(y+2) 

C;»vy 


Concentrator  2^v^y  and 

0 (y>>ux 

y=2 

I 

Case  1.2.2.  x=y 

12 

11 

(v+4)y 

12 

11 

Case  1.1.  u*v 

and  xky 

11 

(v+1) (y+2)-l 

11 

11  11 

11  11  11  11  11  - 

.  . 

12  12  12  -  -  -  Case  1.2.1 


(2)(2,l2,(2H2,l2,9 


Figure  2 


<y>  (y;1,<y;2)‘y;3,(yy  ) 


Figure  2  (continued) 


n 

capacity 


*  Table  2 
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APPENDIX 

Hall's  Theorem  [4): 

Any  set  of  inputs  I'  ,  I'ci  ,  has  a  one-to-one  corresponsence  with 
some  output  set  0*£0  if  and  only  if  for  any  subset  I"£l '  we  have  that 

|u<I«)|>Jl«|  . 
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Abstract 

A  concentrator  is  a  type  of  interconnection 
network  which  can  provide  disjoint  paths  by  means 
of  closing  switches  (usually  called  crosspoints) 
from  any  specified  set  of  inputs  to  the  network 
(where  this  input  set  has  cardinality  less  than  or 
equal  to  some  number  called  the  capacity)  to  some 
arbitrary  set  of  outputs  of  the  same  size.  In 
this  paper,  we  consider  the  calculation  of  the 
capacity  of  a  class  of  concentrators  called  com¬ 
posite  concentrators.  Composite  concentrators  are 
one-stage  interconnection  networks  with  sparse 
crosspoint  patterns  between  the  inputs  and  outputs 
where  this  total  crosspoint  pattern  is  determined 
from  the  crosspoint  patterns  of  smaller,  component 
concentrators  which  have  the  so-called  binomial 
crosspoint  pattern. 


Introduction 

A  concentrator  is  a  type  of  interconnection 
network  which  can  provide  disjoint  paths  from  any 
specified  set  of  inputs  to  the  network  (where  this 
input  set  has  cardinality  less  than  or  equal  to 
some  number  called  the  capacity)  to  some  arbitrary 
set  of  outputs  of  the  same  size.  Ti  e  crucial 
limitation  of  this  type  of  interco.'-.t^ction  network 
is  that  the  output  set  to  which  the  specified  in¬ 
puts  can  be  connected  cannot  be  arbitrarily  speci¬ 
fied.  Indeed,  the  assignment  of  a  given  input  to 
a  particular  output  is  often  not  possible.  For 
example  Figure  1  shows  a  design  of  one-stage,  six 
input,  four  output  concentrator.  In  this  design 
an  ”x"  represents  a  crosspoint  between  input  and 
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FIGURE  1 


output  lines  indicating  that  there  can  be  a  con¬ 
nection  between  the  lines.  Note  that  the  input 
set  (1,2)  can  be  connected  to  any  one  of  the  out¬ 
put  sets  { a , b }  ,  (a,c)  ,  or  {b,c}  .  But  it  is 
clear  that  there  is  no  means  of  connecting  (1,2) 
to,  say,  (a,d)  .  Hence,  there  is  limited  access  to 
the  outputs  from  the  inputs.  Nevertheless,  it  will 
be  shown  later  that  this  concentrator,  called  a 
binomial  (^ ) -concentrator  (since  the  network's 
column  crosspoint  oattern  displays  exactly  once 
every  possible  combination  of  placing  two  cross- 
points  on  the  four  rows),  can  connect  any  four  (or 
fewer)  inputs  to  some  output  set  of  the  same  size. 


Hence,  this  network  has  a  capacity  of  4. 

Sparse  crosspoint  placement  in  a  one-stage  net¬ 
work  to  provide  a  maximum  capacity  is  the  main  topic 
of  this  paper.  More  particularly,  we  will  consider 
an  extension  of  the  binomial  design  method  for  con¬ 
centrators.  The  binomial  (“) -concentrator  and  its 
associated  capacity  were  first  discussed  in  (1]  and 
12].  An  extension  of  this  design,  the  composite 
binomial  concentrator,  was  discussed  in  [2]  and  13], 
In  this  paper,  a  further  extension  of  this  design, 
called  a  higher  order  binomial  composite  concentra¬ 
tor,  is  considered. 

Preliminaries 

A  one-stage  concentrator  can  be  defined  as  a 
triplet  (l,0,r),  where  I  is  the  input  set,  0  is 
the  output  set,  and  r  is  the  crosspoint  placement 
relation  between  I  and  0 .  For  the  example  of 
Figure  1,  I  »{ 1,2, 3,4, 5,6}  ,0«(a,b,c,d}  ,  and 
r  -  {  (l,a)  ,  (l,b)  ,  (2, a)  ,  (2,c)  ,  (3, a),  <3,d)  ,  <4,b)  , 

(4,c)  ,  (S,b)  ,  (5,d)  ,  (6,c)  ,  <6,d)j  .  r  can  be 
expressed  by  a  graph  as  well  as  by  a  crosspoint 
diagram  as  shown  in  Figure  1.  When  only  a  conveni¬ 
ent  summary  of  the  properties  of  a  concentrator  need 
be  expressed,  another  triplet  of  numbers  (|lj,|o|,c) 
where  | X (  is  the  number  of  inputs,  j 0 )  is  the 
number  of  outputs,  and  c  is  the  capacity  of  the 
concentrator  can  be  used.  Recall  that  the  network 
has  capacity  c  if  any  k  inputs,  k  s_c  ,  can  be 
connected  to  some  k  outputs  using  k  ~of  the 
crosspoints.  For  the  concentrator  of  Figure  1, 
this  triplet  is  (6,4,4)  . 

A  binomial  (“) -concentrator  is  a  one-stage 
concentrator  having  (“)  inputs,  u  outputs,  and 
where  each  of  the  (u)  inputs  has  r  crosspoints 
to  a  unique  choice  of  v  of  the  u  outputs. 

A  composite  concentrator  q*  C2  *  (Ij,xl2  > 

0jx02  >  r12^  of  two  component  networks  C  •> 

(Il'°l’rl*  and  C2  "  lI2,02'r2*  is  a  on^_sta9e 
concentrator  where  IjXl2  and  0**  02  are 

Cartesian  products,  and  r^2  is  the  crosspoint 
placement  relation  between  I.xl,  and  O.xO,  which 
is  defined  as  ‘ 

rl2lili2)  "  <rl(il)  x  r2U2))  ' 

for  i^  c 1^  and  i2  £  Ij  .  Figure  2  shows  a 
(|) (|) -composite  concentrator  which  was  composed 
from  a  (|) -concentrator  and  a  (J) -concentrator . 

Higher  order  concentrators,  c1xC2*...«Cn  ,  are 
defined  in  a  similar  manner. 

Isolation  Arrays 

For  an  input  i  c I  of  a  single  concentrator 
(l,0,r)  ,r(i)  is  the  set  of  outputs  to  which  i 
has  crosspoints  and  is  therefore  capable  of  being 
connected.  Hence,  without  all  the  outputs  of  r(i), 
the  input  i  is  isolated  from  the  outputs.  looking 
at  this  from  the  output  side,  for  any  subset  O’  of 
the  outputs  0  ,  the  issue  is  what  set  of  inputs  are 
isolated  by  O’  .  We  will  express  such  a  subset  of 
inputs  as  o(0* )  •  Hence  o(O')  is 

ol0')»(i|icl  and  r(i)5_0'}  . 
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<2>  (f) “composite  concentrator 


FIGURE  2 


Now  for  all  subsets  of  outputs  of  size  k  ,  we 
define  the  isolation  number  as: 


max  l«(0*)|  . 

k  O'  co  and  ) O’  (  •=  k 


h  ■  y  h  .  >  c , 

C  >,  1 

1*1 


where  (hi,h2,....hj0| )  is  the  isolation  vector  of 
the  concentrator. 


That  is,  the  isolation  number  is  the  maximum  size 
of  all  the  isolation  sets  which  are  isolated  by 
any  output  set  of  size  k  .  With  isolation  numbers, 
we  can  define  an  isolation  vector  as: 


S  *  lVh2 . h|0|  ’ 

h.  *  H.-H.  ,  for  1  <j -c ( 0 {  and  H 
1  1  1-1  1  1  0 


A  higher  order  isolation  array  for  composite 
concentrators  is  defined  by  the  array  multiplica¬ 
tion  of  isolation  vectors.  To  begin,  the  two  di¬ 
mensional  isolation  array  of  two  isolation  vectors, 
say,  and  Sj  is  obtained  from  ordinary  vector 
multiplication.  More  generally,  the  elements  of  a 
m*-*1  order  isolation  array  of  m  vectors 
®1  *  ®2  *  ..*  are  defined  by  products  such  as: 


h. 


,1  l2  i  .m 

n  .  n ,  •••!).  •  •  «n . 

31  32  ’i  3m 

th 


where  h.  is  the  j.  element  of  the  S  iso- 
a  i 

lation  vector. 


Capacity  Calculation 

The  capacity  of  a  concentrator  can  be  calcu¬ 
lated  by  means  of  its  isolation  vector. 

Theorem  1  ( 3] :  The  capacity  of  a  concentrator  is 
the  minimum  value  of  c  which  satisfies 


For  a  binomial  (“) -concentrator,  the  isolation 
vector  can  be  shown  to  be  [3J : 


“$-X> 


(i-i> 


This  leads  simply  to: 

Theorem  2  12) ,  [3) :  The  capacity  of  (^-concentra¬ 
tor  is  v  +  2  . 


This  can  be  extended  to  second  order,  binomi¬ 
al,  (“) (J) -concentrators  as  follows: 

Theorem  3 :  The  capacity  of  a  binomial  (^)  (*[)- 
concentrator  is  the  minimum  value  of  c  which  sat¬ 
isfies 

c 

J  h.  .  >  c  , 

L  13 


where  the  summation  is  over  c  of  the  hjj ’s  , 
where  if  hjj  is  in  the  summation,  then  all 


h^'s  ,  K^i  and  l  <_j  ,  are  also  in  the  summation. 

For  the  special  class  of  a  second  order  bi- 
nomial  (VJ2) (v*2) -concentrator  or,  more  simply,  a 
(Vy  ) ^-concentrator  the  isolation  array  is  the  two 
dimensional  array  shown  in  Figure  3.  Now,  the  sum¬ 
mation  of  h,i  ' s  up  to  and  including  h,  ,  ,, 

J  (v-fl),  (v+1) 

satisfying  the  ordering  required  by  Theorem  3  yields 
a  sum  which  is  equal  to  the  number  of  elements 
added.  That  is. 


Theorem  5 :  The  capacity  c  of  a  n^  order  (Jj)n- 
concentrator  is  bounded  as  follows: 

(v+l)n  <  c  <_v*  (v+1) n  3(v+2)  for  u=v+2  and  n^3  , 
(v+l)n  <  c  <v-  (v+l)n_2 (v+3)  for  u^y+3  and  n  >_2  . 
Proof:  Part  1:  (v+l)n<C 

For  the  two  dimensional  isolation  array  case, 
the  maximum  sum  of  (v+1) 2  elements  satisfying  the 
ordering  required  by  Theorem  4  can  be  shown  to  be 

v+1  v+1  , 

I  l  h  *  (v+D2  • 

i=l  j«l  3 


1  +  v  +  h  +  h,  +  . . ,  +  h  .  <  v3  +  iv3  2 
2  3  l  — 


Noting  tnat 


1  +  v  +  h2  +h3  +  ...  +hi  -  (V*1)  , 


(V+i)  .  (v+1 ) (v+2) . . . (v+i)  <  i  +  i-1 

v  il  ' 

We  can  conclude  that  there  are  not  m<  (v+l)n  ele¬ 
ments  of  the  isolation  array  that  will  sum  to  more 
than  m  .  Hence  (v+l)n  <  c  . 


This  generalises  to  the  n  order  isolation 
array  case  where  it  can  be  shown  that  the  maximum 
sum  of  lv+l)n  elements  is 


c  «.v*  (V+I)"'3 

(v+2)2 

for 

u*v+2 

and 

n^3 

(a) 

and 

c  <^v*  (v+l)n 

(v+3) 

for 

u>v+3 

and 

n  >_2 

(b) 

v+i  v+1  v+i 

l  l  ***  l  h+  +  a  - 

3A-l  i2*l  Jn-1  3lV*°n 


(v+l)n  . 


Now,  this  part  ol  the  proof  will  be  completed  if 
it  can  be  shown  that  there  is  no  way  of  choosing 
m  <  (v+l)n  elements  which  sum  to  more  than  m  . 

To  show  this,  it  will  first  be  convenient  to  sim¬ 
plify  the  notation.  We  will  therefore  write  the 
isolation  vector 

((v°1) . <£!>.  <£!)•  (v-i1,  Ci’*  ‘v-i1"**' 

(V^IJ).(V^1)....)  . 

more  simply  as 

(Oy .» »» Of l,v,h.|h.i « ,h  sih  •••)  • 

21  n-l  n 

Using  this,  the  two  dimensional  isolation  array  can 
be  given  as  shown  in  Figure  5. 

- - (v+2) — - •* 

t  |  0,...,0,  o,  0,  0,  0,....  o»... 

!  I  .  •  .  •  • 

I  >  :  :  :  :  :  : 

v  .  0  ,  ...  ,  0  ,  0,  o,  0,  0,....  0,..- 

I  0  ,  ....  0  ,  1,  V,  hj,  h3  '  '  h„»  ••• 


,  0  ,  v 


vhyr  vh3, 


0,  h2,  vh2,  h^ ,  h2h3,  ....h^,. 


1  +  v  +  hj  <  v(v+2)  «v  +  2v  . 
Similarly,  for  higher  order  cases. 


These  two  inequalities  can  be  proven  by  show¬ 
ing  that  there  always  exists  a  set  of  m  elements 
which  sums  to  more  than  m  when  m  is  equal  to 
v.  (v+l)n_2(v+2)2  or  v. (v+1) n"2 (v+3)  respectively. 
Considering  (b)  of  Part  2  first,  we  have  previously 
shown  that  in  the  two-dimensional  case  that  there 
exists  v* (v+3)  elements  satisfying  the  subscript 
ordering  requirement  that  sum  to  more  than  v«(v+3). 
Now,  to  go  to  higher  dimensional  isolation  arrays, 
consider  the  first  v+1  elements  of  the  isolation 
vector 


(0,0, 


•  •  ,  0  ,  1  ,  v  , 
v+1  - 


0,  hn,  vh„,  h2hn,  h3hn,  . . .  ,  hn,  • 


Figure  5 

Clearly  (and  as  indicated  in  Figure  5),  the  minimum 
number  of  elements  that  must  be  summed  to  include 
h2  in  the  summation  is 

2  2 
v-  (v+2)  «  v  +  2v  <  (v+1)  , 

Moreover,  this  is  the  only  possible  selection  of 
less  than  (v+1)2  elements  that  sums  up  to  more 
than  (v+1)  .  Hence,  so  as  not  to  exceed  v • (v+2)  , 


These  first  v+1  elements  clearly  sum  to  v+1  so 
as  we  expand  to  higher  dimensional  arrays  (h  ^>3)  , 
it  is  clear  that  we.  can  choose  for  each  additional 
dimention  (v+1)  times  the  number  of  elements  of 
the  previous  dimension  which  sum  to  (v+1)  times 
the  previous  sum.  In  other  words,  for  the  nt3*  order 
case,  if  we  select  v. (v+1) n-2 (v+3)  elements, 
h.  .  ,  where  1<1 ,<v  ,  lfj  <(v+3)  ,  and 

3132* ‘ ’3n  1_  2~ 

ly^lv+1)  ,  3^k^n  ,  it  follows  that  they  will  sum 

to  more  than  v- (v+1) n“2 (v+3>  . 

Considering  next  (a)  of  Part  2,  it  is  easily 
seen  that  for  the  three  dimensional  case,  the  ele¬ 
ments  h .  .  ,  l<3,<v  ,  l<j  <(v+2)  ,  k  «  2, 3  ,  sum 

313233 

to  more  than  v(v+2)2  .  Then,  similar  to  the  argu¬ 
ment  for  (b) ,  expanding  to  higher  dimensions  will 
maintain  this  unequality  since  the  number  of  ele¬ 
ments  and  the  sum  increase  by  a  factor  of  (v+1) 
for  each  new  dimension.  In  other  words,  for  the  n^ 
order  case,  if  we  select  v  *  (v+1) n“3 (v+2) 2  ele¬ 
ments,  h.  .  ,  ,  where  l<j  <v  ,  l<j  <(v+2)  , 

3132* ’ "3n  *“ 

k«2,3  ,  and  l^j  ^<v+ 1  ,  4<i<n  ,  it  follows  that 

they  will  sum  to  more  than  v  (v+l)n"2(v+2)2  . 

Q.E.D. 

Beyond  the  capacity  bobnds  of  Theorem  5  for 
(y)  -concentrators,  an  essentially  similar  argument 
to  that  of  Theorem  4  yields  the  exact  capacity  for 
sufficiently  large  n  .  This  result  is  stated  with¬ 
out  proof  as 

Theorem  6:  The  capacity  of  (}*)  n-concentrator  for 
sufficiently  large  n  is 

v  (v+l)  n~3  (v+2)2  -  (n„2)+4  for  o-v+2 
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2  2 
1 +V+V+V  ■  (v+l) 


and  this  is  the  maximum  value  of  any  summation  of 
(v+1)^  terms  in  the  allowable  adding  order.  The 

next  nonzero  element,  h  .  ,  is  v  elements 

v, (v+2) 

away  with  a  separation  of  v-1  zeros.  But 


v, (v+2) 


v  (v+l) 
2 


v+l  v+l 

l  l  *i3 
i*l  j-1  3 


(v+l)2  <  (v+l)2  +  v»  (v+l)  (v+2)-l 

<tv+1)2+ii|lil  . 


Therefore  from  Theorem  3,  the  capacity  of  the  (Vy2)- 
concentrator  is  (v+l) (v+2)-l.  Similarly,  the 
isolation  array  for  the  lvJ3) 2-concentrator  is 
shown  in  Figure  4.  The  maximum  summation  of 
v(v+3)  elements  of  the  isolation  array  satisfy¬ 
ing  tlje  required  ordering  is 


v  v+3 

I  I  hii  '  3+v+ 

i-1  j-1  3 

While  for  v  >  3 


v(v+i)  v(v+l)  (v+2) 


>  v(v+3)  . 


V  VT  J 

v(v+3)  >  jv(v+l)  +1-^  h..  -hVf(v+3,  • 

Therefore  from  Theorem  3,  for  v>  3,  the  capacity 
of  (VJ3) 2-concentrator  is  v(v+37  . 


Since  it  is  easily  seen  that  any  l“) 2-concen¬ 
trator  with  u  ^v  +  3  has  an  isolation  array  in 
which  the  isolation  array  of  (v*3) 2-concentrator  is 
embedded,  it  can  be  concluded  that  the  capacity  of 
any  (}J)  2-concentrator  with  v^3  and  u  >»  +  3  is 
v  (v+3)  . 

Extensions  to  Higher  Orders 

Theorem  3  can  be  extended  to  the  n^h  order  bi¬ 
nomial  concentrator  case  as  follows. 

Theorem  4:  The  capacity  of  an  nt3)  order  binomial 

(“1) (“?) . . . (un) -concentrator  is  the  minimum  value 
vi  V2  vn 

of  c  which  satisfies 


)  h.  .  .  >  c  , 

3l32*"3n 

where  the  summation  is  over  c  of  the  h.  .  's 

...  3 13  2’ *  *3n 

where  if  h.  .  .  is  in  the  summation,  then 

Is.  js  -  t  .  .X 

x  2  n 

all  h.  's  ,  l<i<n  k.  >  l.  ,  are  also  in 

x  2  n 
the  summation. 

In  general  the  number  of  summations  of  h's 
which  satisfy  the  conditions  of  Theorem  4  is  pro¬ 
portional  to  an  exponential  function  of  c  .  There¬ 
fore,  exhaustively  examining  all  of  them  to  find 
the  minimal  is  impractical.  Fortunately,  for  bi¬ 
nomial  (^)n-concentrators,  capacity  formulas  for 
sufficiently  large  n  can  be  obtained. 


Third  Order  Concentrators 

The  capacity  of  the  general  class  of  (“) (“) 

(5j) -concentrators  is  worth  considering  as  it  dem¬ 
onstrates  how  the  capacities  of  composite  concen¬ 
trators  vary  according  to  the  parameters  of  the 
component  concentrators  (which  in  this  case  are 
u,v,w,x,y,  and  z).  Accordingly,  Figure  5  gives  the 
capacity  of  some  (^) (") <J) -concentrators.  Each  of 
these  capacities  was  calculated  with  a  program 
that  examines  O(vxz^)  summations  of  elements  of 
the  three  dimensional  isolation  array  (all  of  which 
satisfy,  of  course,  the  subscript  ordering  require¬ 
ment  of  Theorem  4). 
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ABSTRACT 

An  (N,M,C) -concentrator  is  an  interconnection 
network  with  N  inputs,  M  outputs  (M^N)  where 
for  any  specified  subset  of  C<_M  inputs,  each  of 
the  inputs  can  simultaneously  be  connected  by  means 
of  a  disjoint  path  to  some  output.  In  this  paper 
we  consider  concentrator  design  from  the  point  of 
view  of  (i)  lower  bounds  on  the  number  of  switch¬ 
ing  elements  called  crosspoints  needed  to  imple¬ 
ment  a  one-stage  concentrator,  and  tii)  the  place¬ 
ment  of  these  crosspoints  on  a  crossbar  grid  to 
produce  sparse  crossbar  networks  which  function  as 
concentrators.  More  particularly,  we  show  minimal 
crosspoint  designs  for  the  special  full  capacity 
cases  where  M « C  .  With  a  new  lower  crosspoint 
bound  that  we  derive  for  general  N,  M,  and  C,  we 
show  that  the  so-called  binomial  network  is  a  mini¬ 
mal  concentrator  design,  and  we  consider  the 
near-minimality  of  composite  binomial  designs. 
Finally,  we  use  our  derived  lower  bound  to  show 
that  Pippenger's  construction  for  a  N  input- 
N  output  superconcentrator,  which  recursively  uses 
concentrators  as  network  components,  requires  at 
least  17.0625N  crosspoints. 

INTRODUCTION 

An  (N,M,C) -concentrator  network  is  an  inter¬ 
connection  network  with  N  inputs  and  M  outputs 
(M<^N)  where  for  any  specified  subset  of 
inputs,  each  of  the  inputs  can  simultaneously  be 
connected  by  means  of  a  disjoint  path  to  some  out¬ 
put.  C  is  referred  to  as  the  capacity  of  the 
concentrator  [11- (6).  The  crucial  limitation  of 
the  connecting  capability  of  this  type  of  intercon¬ 
nection  network  is  that  given  a  specified  set  of 
inputs,  in  order  to  simultaneously  connect  each 
input  in  this  set  of  an  output,  the  outputs  cannot 
in  general  be  arbitrarily  specified  (as  would  be 
the  case  in  a  permutation  network). 

Concentrators  are  becoming  increasingly  impor¬ 
tant  in  distributed  computing  systems  requiring  the 
establishment  of  disjoint  communication  paths  be¬ 
tween  a  subset  of  a  large  set  of  not  necessarily 
identical  devices  and  a  smaller  set  of  identical 
devices.  For  example,  suppose  that  each  input  was 
connected  to  a  terminal  device  and  each  output  was 
connected  to  a  computing  device.  At  any  one  time, 
up  to  C  users  could  eaclv  be  at  some  subset  of  the 
N  terminal  devices,  each  requesting  a  connection 
to  a  computing  device.  Clearly,  since  all  the  com¬ 
puting  devices  are  identical,  it  would  not  matter 
which  computing  device  was  connected  to  each  termi¬ 
nal  device.  Concentrators  satisfy  this  type  of 
interconnection  requirement. 


The  actual  connections  of  inputs  to  outputs 
in  a  concentrator  are  accomplished  by  means  of  a 
switch  mechanism  referred  to  as  a  crosspoint.  The 
detailed  implementation  of  an  actual  crosspoint  is, 
of  course,  application  dependent.  In  the  following 
a  crosspoint  can  simply  be  considered  to  be  a 
switching  element  when  "closed"  provides  a  connect¬ 
ing  path  between  the  line  entering  it  and  the  line 
leaving  it,  as  opposed  to  when  it  is  "open"  and  no 
such  connecting  path  exists  between  these  two  lines. 

A  concentrator  is  called  a  one-stage  concen¬ 
trator  if  it  is  realized  in  the  form  of  a  so-called 
crossbar  network  where  between  any  input  and  any 
output  there  is  at  most  one  crosspoint.  In  the 
following  it  should  be  understood  that  we  will  be 
concerned  only  with  one-stage  concentrators.  Ob¬ 
viously,  a  complete  crossbar  network  in  which  there 
is  a  crosspoint  between  each  of  the  N  inputs  and 
M  outputs  can  operate  as  a  (N,M,M) -concentrator. 
However,  since  NM  crosspoints  is  usually  a  prohibi¬ 
tively  high  number,  we  will  consider  sparse  cross¬ 
bar  networks  in  which  there  is  at  least  one  input 
and  output  between  which  there  is  no  crosspoint. 
Clearly,  the  number  of  crosspoints  and  their  place¬ 
ment  are  the  crucial  issues  relative  to  the  capa¬ 
bilities  of  sparse  crossbar  networks.  Accordingly, 
our  concern  in  the  following  will  be  with  the  total 
number  of  crosspoints  and  specifications  of  the  out¬ 
puts  to  which  each  input  has  crosspoints  or,  equiva¬ 
lently,  specifications  of  the  inputs  to  which  each 
output  has  crosspoints  on  a  crossbar  grid  such  that 
the  resulting  network  functions  as  a  (N,M,C)- 
concentrator.  Such  specifications  will  be  referred 
to  as  a  design  of  that  concentrator. 

Preliminaries 

In  order  to  consider  sparse  crossbar  networks 
as  concentrator  designs,  we  will  need  the  following 
notation  and  definitions.  Any  one-stage  crossbar 
network  can  be  described  as  a  triplet  (I,0,R), 
where  I  is  the  input  set  ,  |l|  <*N  ,  0  is  the  out¬ 
put  set  ,  |o|  «M  ,  and  R  is  a  relation  between  I 
and  0  where  for  icl  and  ocO,ocR(i)  inplies 
there  exists  a  crosspoint  between  input  i  and  out¬ 
put  o .  This  will  at  times  be  expressed  in  the 
following  by  the  statement  that  input  i  is  incident 
to  output  o ,  or,  equivalently,  that  output  o  is 
incident  to  input  i  .  R  can  graphically  be 
expressed  as  a  crosspoint  diagram  such  as  that  shown 
in  Figure  1.  In  this  figure,  (which  will  later  be 
identified  as  a  (J)-network)  I « {1,2, 3,4, S, 6)  , 

0  -  (a,b,c,d)  ,  and  R(l)  »  (a,b)  ,  R(2)  »  (a,c)  ,  R(3)  « 
U,d)  ,  R(4)  -  {b,c}  ,  R(5)  -  (b,d)  ,  R(6)  -  {c,d)  .  An 
*«"  on  the  grid  between  input  i  and  output  o 
implies  that  o  e  R(i)  . 
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Fig.  1-  A  (j) -network. 
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network  is  a  design  of  a  (6,4,41-concei 

A  subnetwork  of  the  network  describe 
{l,0,R}  is  simply  a  network  described  b: 

(I ' ,0' ,R* )  where  I'  SI.  O'cO, 

R*  £R  . 

The  isolation  set  ,  a  (O')  ,  of  a  subset  O'  c_0 
of  a  network  described  by  (I,0,R)  is  the  subset 
I '  ^ I  consisting  of  inputs  incident  to  outputs 
only  in  O'  .  That  is. 


O' 
o(O') 


( i  |  i  c  I  and  R(i)  c_  O'  )  .  . 


Full  Capacity  Concentrators 


An  (N,M,C)-concentrator  where  C«M  is  called 
a  full  capacity  concentrator.  The  following  is  a 
necessary  condition  on  designs  of  full  capacity 
concentrators. 


Theorem  li  The  minimal  number  of  crosspoints 


required  in  the  design  of  an  (N,M,H)-concentrator 
is  <N-M+1)M. 


Proof ;  Assume  that  some  output,  say,  ocO  of  a 
design  of  an  (N,H,M) -concentrator  was  incident  to 
N-M  inputs.  This  means  there  are  N-  (N-m)  -  M 
inputs  with  no  crosspoints  to  o  .  But,  clearly 
this  contradicts  the  assumption  that  this  design 
functions  as  a  (N,M, Ml -concentrator,  since  these 
M  inputs  only  have  crosspoints,  and,  therefore, 
can  be  connected  to,  at  most  M-l  outputs. 

Hence,  in  a  (N,M,M) -concentrator  design,  each  out¬ 
put  must  be  incident  to  at  least  N  -  M  ♦  1  inputs. 
This  results  in  a  total  of  at  least  (N-M+UM 
crosspoints  in  any  design. 


Q.E.D. 


Designs  of  (N,M,M) -concentrators  having  exactly 
this  minimal  required  number  of  crosspoints  can  be 
given.  Clearly,  in  such  designs,  the  placement  of 
these  (N-M  +  1)M  crosspoints  must  be  such  that  for 
any  choice  of  M  of  the  N  inputs,  all  subsets 
of  size  M'  <  M  of  these  chosen  M  inputs  must 
collectively* be  incident  to  a  total  of  at  least  M’ 
outputs.  Figures  2(a)  and  2(b)  give  two  designs 
of  a  (6, 4, 4) -concentre tor  satisfying  this  condition 
and  having  the  minimal  number  of  12  crosspoints. 
These  two  designs  can  obviously  be  generalized  for 
any  (N,M,M)-concentrators.  It  should  be  noted  that 


these  designs  are  non-isomorphic  in  the  sense  that 
one  cannot  be  transformed  into  the  other  by  row/ 
column  permutations.  _ _ 


Fig.  2.  Two  designs  of  a  (6,4,4) -concentrator. 

In  addition  to  the  two  designs  of  Figure  2, 
for  certain  values  of  N  and  M  ,  other  non¬ 
isomorphic  designs  of  full  capacity  concentrators 
can  be  given.  We  have  already  stated  that  the 
binomial  (^-network  of  Figure  1  was  a  design  of  a 
(6,4,4)  concentrator  [3).  Note  that  this  design 
has  the  minimal  number  of  12  crosspoints.  More 
generally,  for  all  v^2  ,  (v*2 ) -networks  are  de¬ 
signs  of  ( (v^2 )  ,  v+2,v+2)-concentrators  [3).  It  is 
clear  that  these  designs  have  the  minimal  number  of 
crosspoints  since 

( (vJ2)-(v+2)+l) • (v+2)»v» (wv2)  • 

In  some  cases  a  subnetwork  of  a  (vv3) -network 
can  be  determined  which  is  a  design  of  a  minimal 
full  capacity  concentrator.  It  can  be  shown  that 
a  (vJ2)-network  is  a  design  of  a  I (VJ2)  ,  v*3  ,  v+2)- 
concentrator  [3) .  For  certain  choices  of  v ,  by 
deleting  inputs  from  such  a  network,  minimal  de¬ 
signs  for  full  capacity  concentrators  with  v+3 
outputs  can  result.  For  example.  Figure  3(a)  shows 
a  (^)-network  which  is  a  design  of  a  (20,6,5)- 
concentrator.  Now,  by  deleting  the  inputs  1,2, 6, 9, 
10,13,14,16,17,  and  18  (where  the  input  numbering 
is  from  left  to  right) ,  the  network  of  Figure  3 (b) 


(a)  (b) 


Fig.  3  (a).  A  (|)-network  which  is  a  design  for 
a  ( 20, 6, 5) -concentrator ; 

(b).  a  design  for  a  (10,6,6)-concentrator. 

results.  It  can  be  shown  that  this  resulting  net¬ 
work  is  a  design  of  a  (10,6,6) -network.  Moreover, 
it  can  be  shown  that  there  are  at  least  144  ways  to 
delete  10  inputs  of  the  (3) -network  to  produce  non¬ 
isomorphic  minimal  designs  of  a  (10,6,61-concentrator. 

Finally,  the  number  of  non-isomorphic  minimal 
designs  of  a  full  capacity  (N,M,M) -concentrator  for 
any  N  and  M  is  an  open  question.  Clearly,  we 
could  always  find  at  least  two.  We  will  see  in  the 
following  sections  that  for  the  more  general  (N,M,C)- 
concentrators,  M > C,  this  is  not  necessarily  the  case. 

A  Lower  Bound  for  (N,M,C)-Concentrators 

In  the  previous  section,  we  showed  minimal  de¬ 
signs  of  (N,M,C)-concentrators  for  the  special  case 
where  M-C  .  In  this  section  we  will  give  a  new 
lower  bound  on  the  number  of  crosspoints  required 
in  any  (N,M,C)-concentrator,  M  >C  .  It  will  be  seen 
that  in  some  cases  designs  satisfying  this  lower 
bound  can  be  given. 


1 
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Consider  any  design  of  a  (N,M,C)-concentrator. 
Assume  that  this  design  contains  Nx  crosspoints, 
where  x  is  the  average  number  of  crosspoints  for 
input.  That  is,  on  the  average,  each  input  is  in¬ 
cident  to  x  outputs.  In  the  sparse  crossbar 
network  corresponding  to  this  design,  it  should  be 
clear  that,  since  the  network  by  assumption  has 
capacity  C ,  any  choice  of  C  of  the  M  outputs, 
say.  O'  defines  a  subnetwork  relative  to  the  in¬ 
puts  in  the  isolation  set,  a (O')  ,  which  functions 
as  a  full  capacity  concentrator.  Let  {Oj^jOj,..., 
0,Nj}  be  the  (?)  possible  choices  of  output  sets 
of  size  C.  Consider  0(0^),  i-1,..., (?) ,  and  let 
Si  be  the  sum  of  the  number  of  outputs  incident 
to  each  of  the  inputs  in  c (0^)  .  since  the  inputs 
of  0(0^)  are  by  definition  isolated  by  0^  ,  we 
can  write  that 


S-  -  l  |R(j)|  . 
jecMO^ 

It  should  be  clear  that  S^  is  the  total  number 
of  crosspoints  between  the  inputs  of  o (0^)  and 
the  outputs  of  Oj.  As  the  original  network  was  a 
design  for  an  (N,M,C) -concentrator,  it  follows  that 
each  of  these  subnetworks  must  each  be  a  design  for 
a  ( |o  (0^) | ,C,C) -concentrator  i«l,...,(?)  .  So,  by 
Theorem  1,  for  all  i-l,...,(?)  , 

( 1 0  (0±)  |  -C+l)  *C  <_  SA  .  (1) 

Summing  over  all  (?)  choices  of  output  sets  of  size 
C  ,  and  dividing  by  (?)  yields 

(?)  f 

T  (|oto  }|-C+1)C  1  s.  (2) 

i=l  _  1-1 

(?)  "  <?> 

It  should  be  noted  here  that  (2)  is  a  necessary 
condition  for  (1) .  That  is,  if  (2)  is  not  satis¬ 
fied,  then  there  must  be  at  least  one  Oi  such  that 
(1)  does  not  hold  and,  therefore,  the  original  net¬ 
work  could  not  provide  a  capacity  of  C .  Now,  (2) 
can  be  rewritten  as 


.  (5) 

M  M-l  *  M-x+1 

Since  there  are  N  inputs,  it  follows  that  the 
average  number  of  inputs  isolated  by  an  output  set 
of  size  ■  C  is 
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M  M-l 


C-x+1 

M-x+1 


)  N  . 


(6) 
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(7) 


Consider  next  the  right-hand-side  of  (3). 
The  term 

<C> 


(?)  (8) 

is  the  average  number  of  crosspoints  in  the  design 
between  a  set  of  Inputs  isolated  by  C  outputs  and 
those  outputs.  Using  (6)  and,  again,  that  there  are 
on  the  average  x  crosspoints  per  input  in  the 
design,  it  follows  that 


(c> 


i-1 


(?) 
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M-l 


C-x+1 

M-x+1 


)  •  Nx 


(9) 


Using  (7)  and  (9)  in  (2),  rearranging,  and 
combining  terms  allows  us  to  state  the  following: 

Theorem  2:  A  lower  bound  on  the  number  of  cross- 

points  in  any  design  of  a  (N,M,C) -concentrator  is 
Nx  where  x  satisfies 

—  -sESr)  <^>*c-c2-o .  do) 

Observe  by  Theorem  2  that,  as  should  be 
expected  for  the  full  capacity  concentration  case 
where  C-M,  (10)  is  satisfied  by 
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Consider  now  the  following  term  from  the  left- 
hand-side  of  (3)  : 


(?) 

I  l<7  <0  >  | 

i-1 

(?)  ’  (4) 

Clearly,  this  term  is  the  average  number  of  inputs 
isolated  by  an  output  set  of  size  C  in  this 
(N,M,C)-concentrator  design.  An  equivalent  expres¬ 
sion  to  (3)  can  be  obtained  as  follows.  Recall 
that  each  input  is  incident  on  the  average  to  x 
outputs  in  the  design.  Hence,  the  probability  that 
an  input  is  isolated  by  C  of  the  M  outputs  is 


(N-M+l) -M 
X  N 

which  agrees  with  Theorem  1. 

Observe  next  that  for  the  general  binomial 
design  of  a  (“)-netvork  for  a  ( (“) ,u,v+2>- 
concentrator  where  N  »  (“)  ,M»u,  and  C»v+2,  (10) 
is  satisfied  by  x-v  .  Hence,  we  have  the  new 
result  that  binomial  networks  are  minimal  designs. 
This  is  stated  formally  as 

Corollary  1:  (“(-networks  are  minimal  designs  of 

((“)  >  u  «  v+2) -concentrators. 

We  can  also  use  Theorem  2  as  a  benchmark  to 
compare  candidate  designs  of  (N,M,C)-concentrators. 
Designs  with  Nx1  crosspoints  where  x’  is  "close" 
to  the  value  of  x  which  satisfies  (10)  are  at 
times  acceptable  for  those  cases  when  no  design 
with  Nx  crosspoints  is  known.  For  example,  con¬ 
sider  the  composite  binomial  network  of  Figure  4. 
This  design  is  obtained  by  taking  a  (§) -network  as 
shown  in  Figure  1  and  replacing  every  crosspoint 


( 


Fig.  4.  A  (\) 2-network, 


with  another  (J) -network  together  with  the  proper 
input  and  output  expansion.  The  resulting  net¬ 
work,  in  this  case,  is  called  a  (£)  -network)  and 
it  can  be  shown  that  it  is  a  design  of  a  (36,15,12)- 
concentrator  [4J.  The  ( j) ^-network  has  Nx' - 
(36W4J-144  crosspoints.  From  (10),  we  find  that 
X- 3.548;  therefore  Nx  »  91.733. 

Linear  Concentrators  and  Superconcentrators. 

A  (Nk,Mk,Ck) -concentrator  for  given  N,M,  and  c 
is  called  a  linear  concentrator  since  the  number 
of  inputs,  outputs,  and  the  capacity  grow  linearly 
with  k  .  Linear  concentrators  have  been  used  as 
network  components  to  prove  the  existence  of  cer¬ 
tain  superconcentxator  designs  15] ,  (61 .  A  super¬ 
concentrator  is  an  interconnection  network  of  some¬ 
what  more  powerful  connecting  capability  than  that 
of  a  concentrator  in  the  sense  that  in  a  super¬ 
concentrator  there  are  N  inputs  and  N  outputs  and 
for  any  specified  subsets  of  K<.N  inputs  and  out¬ 
puts,  there  exists  a  disjoint  path  to  connect  each 
input  to  some  output  in  the  respective  subsets. 
Hence,  unlike  a  concentrator,  for  a  super concentra¬ 
tor  both  the  input  and  output  subsets  to  be  con¬ 
nected  can  be  specified  and  they  can  be  of  any  site 
less  than  or  equal  to  N .  However,  within  these 
subsets  the  particular  one-to-one  connections  cannot 
be  specified.  Pippenger  [6]  proved  the  existence 
of  superconcentrators  with  at  most  40N  crosspoints 
This  proof  was  based  upon  a  recursive  construction 
using  (6k,4k,3khconcentrators  which  Pippenger  proved 
existed  with  at  most  36k  crosspoints.  In  this 
section,  we  will  establish  a  lower  bound  on  the 
number  of  crosspoints  in  superconcentrator  designs 
using  Pippenger  construction.  This  will  be  accom¬ 
plished  by  establishing  lower  bounds  on  crosspoints 
for  the  linear  concentrators  it  utilizes. 

To  begin,  consider  the  more  general  case  of 
(Nk,MK,Ck)-concentrators.  For  this  case,  an  ap¬ 
proximation  to  (10)  of  Theorem  2  for  sufficiently 
large  k  is 

(— )x.  (Nk)  •  (Ck-x) eck-c2k2  -  0  .  (11) 

MK 

Now,  upon  rearranging  (11)  and  dividing  by  k  , 
we  can  consider  the  limit  of  the  result  as  k  ap¬ 
proaches  infinity.  That  is, 

VU"{!4((!>*-NC-C2)-  -3((|)X-Nx-C)}  -  0  .  (12) 

kZ  M  k 

Clearly,  (12)  converges  to 

(£)x.NC  -C2  -  0  .  (13) 


Talking  logarithms  yields 

_  log  N  -  log  C  _  (14) 

log  M  -  log  C 

Equation  (14)  represents  a  lower  bound  on  the 
average  number  of  crosspoints  in  any  (Nk,Mk,Ck)- 
concentrat or  design  for  large  k . 

Returning  then  to  the  issue  of  Pippenger* s 
superconcentrator  construction  which  was  based  on 
the  recursive  use  of  (6k,4k,3k)-concentrators  with 
6  crosspoints  per  input,  it  is  seen  from  (14)  that 
a  lower  bound  on  the  number  of  crosspoints  per  in¬ 
put  for  such  linear  concentrators  is  x«  2.40942. 
When  (6k, 4k, 3k) -concentrators  with  (2.40942) * (6k) 
crosspoints  are  used  in  Pippenger 's  superconcen¬ 
trator  construction,  it  can  be  seen  that  a  lower 
bound  on  the  number  of  crosspoints  in  the  result¬ 
ing  superconcentrator  is  17.4565N. 

It  is  of  interest  to  consider  alternative  lin¬ 
ear  concentrators  because  Pippenger 's  superconcen¬ 
trator  construction  only  requires  the  use  of  con¬ 
centrators  where  the  capacity  is  equal  to  one-half 
the  number  of  inputs,  but  the  ratio  between  the 
number  of  inputs  and  outputs  has  some  flexibility, 
let  9  correspond  to  this  ratio  in  the  sense  that 
we  will  consider  (n,0n.  In  ) -concentrators.  Now, 
from  (14), 

log  n-  log  j  j  2  (15) 

x  -  — . —  -  -  — - , 

log  Bn  -  log  -j-  log  2  +  log  B 

then  the  number  of  crosspoints  in  Pippinger's  con¬ 
struction  can  be  expressed  as 


n.(2. 


lo?  2 - -  +l)*~r 


log  2  +  log  9 


i. 

1-S 


(16) 


where  i  <  B  <  1  . 

Numerical  analysis  shows  that  (16)  is  mini¬ 
mized  when  B«  0.70251.  From  (15)  this  results  in 
x-2. 03835.  Then,  if  (n  ,0. 70251  n,ii ) -concentrators 
with  2.03B35n  crosspoints  were  used  with  Pippenger’s 
construction,  the  result  would  be  a  lower  bound  of 
17.0652N  crosspoints.  No  design  of  a  superconcen¬ 
trator  using  Pippenger *s  construction  could  have 
less  crosspoints. 
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Abstract 

Lower  bounds  on  the  required  number  of 
crosspoints  in  realizations  of  interconnection 
networks  called  concentrators  and  superconcen¬ 
trators  are  given.  This  work  is  in  contrast  to 
many  of  the  other  results  in  the  literature  which 
establish  upper  bounds  by  proving  that  with  a 
certain  number  of  crosspoints,  various  concen¬ 
trator  designs  do  exist.  The  lower  bounds  are 
obtained  by  using  a  straightforward  necessary 
condition  on  the  number  of  crosspoints  for  the 
special  case  of  sparse  crossbar  full  capacity 
concentrators.  It  is  seen  that  this  condition 
iffust  be  satisfied  by  all  embedded  full  capacity 
concentrators  contained  in  more  general  concen¬ 
trator  designs,  and  on  the  basis  of  this  obser¬ 
vation,  a  general  necessary  condition  on  the 
required  number  of  crosspoints  in  a  concentrator 
is  established.  This  general  crosspoint  condi¬ 
tion  is  exploited  to  obtain  our  lower  bound  re¬ 
sults  by  demonstrating  that  an  equal  distribution 
of  a  given  number  of  crosspoints  between  inputs 
and  outputs  in  a  sparse  crossbar  concentrator  is 
the  most  efficient  use  of  those  crosspoints  rela¬ 
tive  to  maximizing  the  resulting  capacity.  As 
the  use  of  sparse  crossbar  concentrators  as  com¬ 
ponents  in  the  designs  of  other  types  of  inter¬ 
connection  networks  is  common,  our  lower  bound 
results  can  also  be  used  to  obtain  lower  bounds 
on  crosspoints  in  those  interconnection  network 
designs  as  well.  In  particular  Pinsker's  multi¬ 
stage  concentrators  and  Pippenger’s  superconcen¬ 
trators  are  considered  in  this  paper. 

Introduction 

An  (n,m,c) -concentrator  is  an  interconnec¬ 
tion  network  with  n  inputs  and  m  outputs 
(m<n)  where  for  an  specified  subset  of  c<m 
inputs,  all  of  the  specified  inputs  can  simul¬ 
taneously  be  connected  by  means  of  disjoint 
paths  to  some  set  of  c  outputs,  c  is  referred 
to  as  the  capacity  of  the  concentrator.  The 
crucial  limitation  of  the  connecting  capability 
of  this  type  of  interconnection  network  is  that 
given  a  specified  set  of  inputs,  in  order  to 
simultaneously  connect  each  input  in  this  set  to 
an  output,  the  outputs  cannot  in  general  be 
arbitrarily  specified  (as  would  be  the  case  in  a 
permutation  network) . 

The  actual  connections  of  inputs  to  outputs 
in  a  concentrator  are  achieved  by  means  of 
switching  elements  referred  to  as  crosspoints. 
The  detailed  implementation  of  an  actual  cross- 
point  is,  of  course,  application  dependent.  In 
the  following,  a  crosspoint  can  simply  be  viewed 
as  a  switching  element  which  when  "closed”  pro¬ 
vides  a  connecting  path  between  the  line  enter¬ 
ing  it  and  the  line  leaving  it,  as  opposed  to 
when  it  is  "open"  and  no  such  connecting  path 


exists  between  these  two  lines. 

A  specification  of  the  crosspoint  place¬ 
ment  between  the  inputs  and  outputs  will  be 
referred  tp  as  a  design  or  explicit  construction 
of  a  concentrator.  Clearly,  the  simplest  design 
of  a  concentrator  is  in  the  form  of  a  so-called 
(sparse  or  complete)  crossbar  network,  where  be¬ 
tween  any  input  and  any  output  there  is  at  most 
one  crosspoint.  Concentrators  can  also  be 
realized  by  designs  in  the  form  of  composite  net¬ 
works.  These  are  made  up  of  various  serial  and 
parallel  combinations  of  sparse  or  complete 
crossbar  concentrators.  Initially  in  this  paper 
we  will  be  concerned  with  only  the  sparse  cross¬ 
bar  concentrator  realizations,  but,  in  the  con¬ 
cluding  sections,  we  will  consider  composite  net¬ 
works  as  well. 

Obviously,  a  complete  crossbar  network  in 
which  there  is  a  crosspoint  between  each  of  the 
n  inputs  and  m  outputs  will  function  as  an 
(n,m,m) -concentrator.  This  special  case  where 
c=m  will  be  referred  to  as  a  full  capacity  con¬ 
centrator  and  will  henceforth  be  denoted  more 
simply  as  an  (n,m) -concentrator.  Similarly,  a 
sparse  crossbar  network  in  which  there  are  cross- 
points  from  each  input  to  any  c<m  outputs  will 
function  as  an  (n,m,c) -concentrator.  However,  as 
might  be  anticipated  on  the  basis  of  the  rela¬ 
tively  small  number  of  input  to  output  mappings 
that  must  be  realized  by  a  concentrator,  the  nm 
and  nc  resulting  crosspoints,  respectively,  for 
each  of  these  cases  are  usually  unnecessarily 
high.  Indeed,  it  is  well  established  that 
(n,m,c) -concentrators  and  (n,m) -concentrators  can 
be  constructed  with  0(n)  crosspoints.  For 
example,  Pinsker  (1J  has  shown  that  (n,m)- 
concentrators  can  be  constructed  with  at  most 
29n  crosspoints.  This  was  accomplished  by  means 
of  a  probabilistic  argument  that  demonstrated  the 
existence  (but  not  the  explicit  crosspoint  place¬ 
ment)  of  some  sparse  crossbar  concentrators  that 
are  components  in  a  composite  concentrator  net¬ 
work.  Similarly,  Pippenger  [2]  used  a  similar 
argument  to  show  the  existence  of  sparse  crossbar 
designs  of  (n  ,  £n  ,  ^.n  ) -concentrators  with  at 
most  6n  crosspoints.  These  were  then  used  as 
components  in  a  composite  design  of  an  intercon¬ 
nection  network  called  a  superconcentrator.  These 
types  of  results  provide  upper  bounds  on  the  total 
number  of  crosspoihts  required  in  the  design  of 
the  associated  concentrator. 

There  also  have  been  contributions  regarding 
explicit  constructions  of  concentrators.  Most 
notably.  Gabber  and  Galil  13]  have  recently  given 
a  construction  of  a  sparse  crossbar  concentrator 
which  was  first  studied  by  Margulis  14)  and  which 
furthermore  permits  an  explicit  construction  of  a 


superconcentrator  in  the  form  of  the  composite 
network  suggested  by  Pippenger.  Masson  15]  also 
gave  a  construction  of  a  sparse  crossbar  concen¬ 
trator  with  what  is  referred  to  as  a  binomial 
crosspoint  placement. 

In  this  paper,  we  will  be  concerned  with 
both  the  minimum  number  of  crosspoints  required 
in  (n,m,c)  and  (n,m) -concentrators  and  some  ex¬ 
plicit  constructions.  Regarding  the  former,  we 
will  establish  lower  bounds  for  sparse  crossbar 
concentrator  designs.  With  these  results,  we 
will  then  be  able  to  examine  some  explicit  sparse 
crossbar  constructions,  seeing  in  some  cases 
(for  example,  Masson's  binomial  concentrator) 
that  these  designs  are,  indeed,  minimum.  Finally, 
we  will  be  able  to  determine  lower  bounds  on  the 
composite  network  realizations  of  concentrators 
given  by  Pinsker  and  superconcentrators  given 
by  Pippenger. 

Preliminaries 

To  consider  sparse  crossbar  networks  as 
concentrator  designs,  we  will  use  the  following 
notation  and  definitions.  Any  sparse  crossbar 
network,  denoted  in  general  as  N ,  can  be 
described  as  a  triplet  (I,0,R),  where  I  is  the 
input  set  ,  1 1 1  =  n  j  0  is  the  output  set ,  1 0 1  -  m  : 
and  R  is  a  relation  between  I  and  0  where  for 
i  e  I  and  o  e  0  ,  o  e  R(i)  implies  there  exists 
a  crosspoint  between  input  i  and  output  o  . 

This  will  at  times  be  expressed  in  the  following 
by  the  statement  that  input  i  is  incident  to 
output  o  ,  or,  equivalently,  that  output  o  is 
incident  to  input  i  . 

A  subnetwork,  N'  ,  of  the  sparse  crossbar 
network,  N  ,  described  by  (I,0,R)  is  simply  a 
network  described  by  (I'.O'.R*)  where  I' cl, 
O'  £0  ,  and  R'  £R  . 

The  isolation  set,  o (O' )  ,  of  a  subset 
O' £0  of  a  sparse  network  described  by  (I,o,R) 
is  the  subset  I'£l  consisting  of  inputs  inci¬ 
dent  to  outputs  only  in  O'  .  That  is, 

o(O')  =  {i | i  e  I  and  R(i)  c_  O' }  . 

The  total  number  of  crosspoints  between  the 
inputs  of  a  (O')  and  the  outputs  of  O'  will 
be  denoted  as  t'  .  It  is  clear  that 

f  "  l  |R(j)|  • 

jeo (O') 

We  will  at  times  in  the  following  consider 
concentrator  designs  where  the  explicit  cross- 
point  placement  is  not  specified,  but,  instead, 
only  the  crosspoint  distribution  of  the  design 
is  given.  W(j)  will  denote  the  number  of  inputs 
in  a  sparse  crossbar  network  that  are  incident 
to  exactly  j  outputs.  In  the  discrete  case, 

W(j)  =  |  { i  |  i  E  I  and  |R(i)|«j}| 


£  W(j  )  =  n  . 

5-1 

This  latter  crosspoint  distribution  and  any  ether 
such  distribution  where  each  input  is  incident 


to  exactly  the  same  number  of  outputs  will  be 
referred  to  in  the  following  as  a  1-point  or 
singular  crosspoint  distribution.  More  generally 
an  f-point  crosspoint  distribution  is  one  in 
which  there  is  a  set  of  t  distinct  incidence 
values,  J  “  { jj, j2» ■ • • .3^)  #  such  that 


W(j)  f  0 


*  jk  for  all  jA 


5  t  J  , 


in  J  ,  i  f  k  i 


and  where 


jL wtj> 


Full  Capacity  Sparse  Crossbar  Concentrators 

The  following  is  a  necessary  condition  on 
the  number  of  crosspoints  in  designs  of  full 
capacity  concentrators. 

Theorem  1:  The  minimum  number  of  crosspoints 

required  in  a  sparse  crossbar  design  of  an  <n,m)- 
concentrator  is  (n-m+l)m  • 

Proof:  Assume  that  some  output,  say  o  £  0  of 

a  sparse  crossbar  design  of  an  (n,m) -concentrator 
was  incident  to  n-m  inputs.  This  means  there 
are  n-(n-m)  =m  inputs  with  no  crosspoints  to 
o  •  But,  clearly  this  contradicts  the  assumption 
that  this  design  functions  as  a  (n,m) -concentra¬ 
tor,  since  these  m  inputs  only  have  crosspoints 
to,  and,  therefore,  can  be  connected  to,  at  most 
m  -  1  outputs.  Hence,  in  a  sparse  crossbar  (n,m)- 
concentrator  design,  each  output  must  be  incident 
to  at  least  n-m+1  inputs.  This  results  in  a 
total  of  at  least  (n-m+l)m  crosspoints  in 

any  design.  _  „  _ 

Q.E.D. 

A  Ixswer  Bound  for  Sparse  Crossbar  (n,m,c)- 
Concentrators 

In  the  previous  section,  we  showed  minimum 
sparse  crossbar  designs  of  (n,m,c) -concentrators 
for  the  special  case  where  m -  c .  In  this  sec¬ 
tion  we  will  give  a  lower  bound  on  the  number  of 
crosspoints  required  in  any  sparse  crossbar 
(n,tn,c) -concentrator ,  c<m  •  It  will  be  seen  in 
the  following  section  that  in  some  cases  designs 
satisfying  this  lower  bound  can  be  given. 

Consider  any  sparse  crossbar  design  of  an 
(n,m,c) -concentrator.  Assume  that  this  design 
contains  nxQ  crosspoints,  where  xQ  is  the 
incidence  mean  of  the  design.  That  is,  ignoring 
for  the  moment  that  in  general  x0  is  not  an 
integer,  each  input,  on  the  average,  is  incident 
to  xQ  outputs.  In  this  sparse  crossbar  net¬ 
work,  since  by  assumption  the  capacity  is  c  , 
any  choice  of  c  of  the  m  outputs,  say,  0 *  , 
defines  a  subnetwork  relative  to  the  inputs  in 
the  isolation  set,  o(o')  ,  which  must  function 
as  a  full  capacity  concentrator.  Let  {0^,03 » 

be  the  (J)  possible  choices  of  out¬ 
put  sets  of  si2e  c  .  Consider  o(0.),  i«=l,..., 
(J)  .  It  follows  that 

( jo  (0^)  j  -c+1)  *c  <_  .  (1) 


Summing  over  all  (£)  choices  of  output  sevs  of 


I  loWi*  I 

,i=l 


-  c+l)c  *  ^ — 

£> 


Recall  that  W(j)  is  the  number  of  inputs 
in  a  sparse  crossbar  network  that  are  incident 
to  exactly  j  outputs.  Now  suppose  that  a  given 
input  is  incident  to  j  outputs.  It  follows 
that  of  the  (£)  possible  selections  of  c  of 
the  m  outputs,  exactly  of  those  choices 

isolate  this  input.  It  is  furthermore  easily 
seen  that 

(?) 

/Ht-j.  —  _  »nt\  .  c  1  (m~3 )  1 

(c-j>  “  <=’  =  c  m!  (c-j)T  • 

Por  convenience  in  the  following,  we  will 


to  be  placed  between  the  n  inputs  and  m  out¬ 
puts  of  a  crossbar  grid  so  as  to  provide  the 
maximum  possible  capacity,  there  is  a  natural 
tendency,  because  of  analogies  which  can  be  made 
with  other  physical  phenomena,  to  in  general  con¬ 
sider  crosspoint  distributions  that  evenly  dis¬ 
tribute  the  crosspoints  among  the  inputs.  In 
other  words,  intuitively  one  would  expect  to  be 
able  to  maximize  the  capacity  with  a  singular 
crosspoint  distribution,  indeed,  if  we  ignore 
the  fact  that  x  in  general  is  not  an  integer, 
this  turns  out  to  be  the  proper  approach,  and 
this  is  demonstrated  rigorously  in  (7J.  Hence 
we  can  conclude  that  to  obtain  our  lower  bound 
on  the  total  number  of  crosspoints  required  in 
sparse  crossbar  designs  of  (n,m,c) -concentrators 
from  our  necessary  condition  as  given  by  (3),  it 
is  only  necessary  to  consider  the  singular  cross- 
point  distribution  case.  This  will  be  formally 
stated  as 

Theorem  2 ;  A  lower  bound  on  the  number  of 
crosspoints  in  a  sparse  crossbar  design  of  an 
(n,m,c) -concentrator  is  r.x  where  x  satisfies 

(x>  2 

-n*  (c-x)  -  cz  «■  c  *  0  .  M) 


<?:3>  * 


With  the  above  and  the  crosspoint  distribution, 
it  follows  that 


l  <c>9(j>W<j>  . 
j-1 


(c> 

I  l°«vl 

i-1 


l  g(j)-wij) 
j-i 


Siralarly,  it  can  be  seen  that 


Binomial  Networks 

An  obvious  question  raised  by  the  lower 
bound  equation  of  Theorem  2  is  whether  or  not 
sparse  crossbar  designs  actually  satisfying  the 
eguation  exist.  It  can  be  seen  that  when  the 
values  of  the  parameters  n,  m,  and  c  satisfy 
certain  binomial  interrelationships,  this  is  the 
case.  More  particularly,  an  (^J)-network  has  (™) 
inputs,  m  outputs,  and  a  singular  crosspoint 
distribution  wherein  each  input  is  incident  to 
v  outputs.  Moreover  in  such  a  network,  for  any 
input,  the  placement  of  the  v  crosspoints  be¬ 
tween  it  and  the  outputs  is  distinct  from  that  of 
any  other  input.  The  capacity  of  such  a  design 
is  known  to  be  v+2  [5).  Substituting  n*  (J) » 

x  -  v  ,  and  c  »  v  +  2  into  the  left-hand  side  of 
(4)  yields 


^ -  “  I  9(j)-W(j)*j  . 

<c> 

Using  the  above  in  (2)  and  rearranging  terms 
yields  the  crosspoint  distribution  form  of  our 
necessary  condition  on  sparse  crossbar  designs 
of  (n,m,c) -concentrators: 

m  , 

l  <g(j)  *w(j)  •  (c-j)  )-c  +  c  <_  0  .  (3) 

j«l 

To  obtain  our  lower  bound,  we  now  must  con¬ 
sider  all  possible  crosspoint  distributions  that 
satisfy  (3)  for  sparse  crossbar  designs  of 
<n,m,c)-concentrators.  For  all  practical  pur¬ 
poses  this  task  would  be  impossible  were  it  not 
for  an  intuitively  satisfying  property  of  cross- 
point  distributions  that  reflects  itself  into 
the  summation  terms  of  (3).  Given  some  fixed 
number  of  crosspoints,  say,  nx  ,  x<c  ,  which  are 


(v+2)  (v+1) 
2 


I  • (v+2-v)  -  (v+2)2  +  <v+2) 


•  (2)  -  (v+2)  +  (v+2) 


In  other  words,  x  =  v  satisfies  (4)  relative  to 
( (") ,m,v+2)-concentrators,  and  binomial  networks 
are,  therefore,  minimum  sparse  crossbar  designs 
of  such  concentrators. 

Fixed  Ratio  Concentrators 

For  given  integers  n,  m,  and  c,  an  (nk,mk,ck)~ 
concentrator  will  be  referred  to  as  a  fixed  ratio 
concentrator  since,  although  the  number  of  inputs, 
outputs,  and  the  capacity  vary  according  to  k , 
their  ratios  remain  fixed.  For  fixed  ratio  con¬ 
centrators,  we  can  again  use  (4)  to  determine  a 
lower  bound  on  the  average  number  of  crosspoints 
in  their  sparse  crossbar  designs.  For  this  case 


with  sufficiently  large  k,  it  is  easy  to  see 
that  our  lower  bound  equation  (4)  can  be  approxi- 
mated  as  .  *  2  , 

(gjf)  *nk  (ck  -  x)  -  c  k  +  ck  =  0  , 

°z  2  _  x  2  x 

k  ( (5)  *nc  -  c  )  -  k  ( (— )  *nx  -  c)  «  0  . 

2 

Dividing  through  by  k  and  taking  the  limit  as 
k  approaches  infinity  yields 

lim.k^  c  x  2  k  ,,c,x  ..  . 

k^(^((m>  *nc-c  )-—((-)  *nx  -  c) )  =  0  . 

k  k 


c,x  2 

(— )  •  nc  -  c 


Taking  logarithms,  we  have 

x  «=  (log  n  -  log  c)  /  (log  m  -  log  c)  .  (5) 

Equation  (5)  represents  a  lower  bound  on  the 
average  number  of  crosspoints  in  any  sparse 
crossbar  design  of  an  (nk,mk,ck)-concentrator 
for  large  k  . 

P insker ' s  Concentrators 

In  this  section  we  will  give  lower  bounds 
on  some  of  the  prominent  composite  networks  for 
concentrators.  These  composite  networks  use 
sparse  crossbar  networks  as  their  building  blocks 
or  components,  and,  hence,  their  lower  bounds 
will  be  based  primarily  upon  the  results  of  the 
previous  sections  on  sparse  crossbar  concentra¬ 
tors.  To  expidite  the  presentation  of  these 
resu’ts  (as  well  as  the  results  of  the  follow¬ 
ing  section) ,  we  will  first  develop  some  nota¬ 
tion  to  describe  composite  networks. 

Given  two  (sparse  crossbar  or  composite) 
networks  n2  and  N2  where  N2  has  an  input  set 
Ij  and,  an  output  set  02  ,  and  where  N2  has  an 
input  set  t2  and  an  output  set  02  ,  |OjJ  =  1 12 1  , 
then  Nj«»N2  will  denote  the  serial  product  of 
N2  and  N2  and  will  represent  a  composite  net¬ 
work  hiving  I2  for  an  input  set  and  02  for  an 
output  set,  and  where  the  output  of  N2  and  the 
inputs  of  N2  are  identified  or  associated  on  a 
one-to-one  basis. 

Given  two  (sparse  crossbar  or  composite) 
networks  N2  and  N2  where  [  IjJ  ”  n2  and  |o2|  “ 
m2  and  where  1 I2f“n2  and  |o2)  *  m2  ,  then 
N2|a,b!N2  ,  where  a<min(n2,n2)  ,  b<min(m2,m2) 
will  denote  the  parallel  product  of"  N2  and  N2 
and  will  represent  a  composite  network  having 
ni+n2-a  inputs  formed  from  I2  u  I2  but  where 
the  "lower"  a  inputs  of  N2  and  the  "upper"  a 
inputs  of  Nj  are  superimposed  so  as  to  have 
paths  to  the  outputs  available  either  through  N2 
or  Nj  ,  and  having  m^+m2-b  outputs  formed  from 
02  u  02  but  where  the  "lower"  b  outputs  on  N2 
and  the  "upper"  b  outputs  of  N2  are  superim¬ 
posed. 

Given  a  network  N ,  then  N  will  denote  the 
inverse  of  that  network  where  the  inputs  of  N 
are  used  as  outputs  and  vice-versa. 

Finally,  we  will  let  In  denote  the  n  input, 
n  output  sparse  crossbar  network  with  cross¬ 


points  only  on  the  main  diagonal. 


Figure  It  A  composite  (n, a) -concentrator. 

Now,  consider  the  composite  network  shown 
in  Figure  1.  This  composite  network  was  first 
presented  by  Pinsker  [1).  It  can  be  shown  [1] 
that  if 


N^  is  an  (2(n-m)  ,  n-m) 


concentrator. 


N2  is  an  (m,n-m)  -  concentrator, 
and  2m  ^  n  , 

then  this  composite  network  is  an  (n,m)  concen¬ 
trator.  Clearly,  this  network  can  be  described 
as 

I  |  (n-m)  ,  m|  (N  =>  N  )  . 
m  l  * 

Pinsker  used  a  composite  network  of  this  type  to 
show  that  in  general  (n,m) -concentrators  existed 
with  at  most  29n  crosspoints.  This  was  accom¬ 
plished  by  considering  in  particular  a  composite 
network  design  of  an  (n,® n ) -concentrator  as 
described  by  6 

rs  n  i?n  •  !n  i  (ni  *  v 
6 

where  (N2  ->  N2)  is  a  composite  design  in  which: 

(i)  N2  is  a  (in  ,  in  ) -concentrator  in  the  form 
of  the  combos itl  network 


in  which  N22  is  a  (yn,^-n,^n  ) -fixed 
ratio  concentrator  and  N22  is  (in  ,  in)~ 
concentrator  (which  it  should  be  noted  bhas 
an  input  set,  output  set,  and  capacity  equal 
to  i  that  of  the  overall  concentrator) ; 

5  5  1 

(ii)  N2  is  a  (g-n  ,  jn  ) -concentrator  in  the  form 
of  the  composite  network 

<N23  ~  N22  -  N21> 

in  which  N23  is  a  (|.n  ,in  )-fixed  ratio 
concentrator,  N22  is6a  6  (in,  In  ,  in)- 
fixed  ratio  concentrator,  and  N22  is  a 


(in,  in)- concentrator  (which,  again,  it  should 
be  noted  has  an  input  set,  output  set,  and  capa¬ 
city  equal  to  that  of  the  overall  concentrator). 

Pinsker's  design  is  clearly  recursive. 

Hence,  using  (5)  for  the  component  fixed  ratio 
concentrators ,  we  have  that  a  lower  bound  on  the 
number  of  crosspoints  in  Nj^  and  N22  is 

,  1  .1 

log  -  -  log  - 

in  ( - £ - £)  =  1.26726  n 

log  i  -  log  | 

and  that  a  lower  bound  on  the  number  of  cross- 
points  in  N23  is 

log  -  -  log  - 

|n  ( - 2 - S.)  =  1.93494  n. 

o  x  _  i 

log  -  -  log  - 

Thus,  a  lower  bound  on  the  number  of  crosspoints 
in  Pinsker's  composite  design  of  a  (n  ,S.n>- 
concentrator  is 

(|n)  +  1.93494  n  +  2  (1.26726  n) 

O 

+  (lower  bound  for  a  Pinsker 

(in,  in)-concentrator) .  (6) 

5  ^ 

Recursive  evaluation  (6)  yields  a  lower  bound  on 
the  number  of  crosspoints  in  a  Pinsker  (n  ,  £n)- 
concentrator  to  be  7.4491 n .  Pinsker,  it  snould 
be  noted,  showed  with  his  design  the  existence 
of  (n  ,  j-n)- concentrators  with  at  most  22  n 
crosspoints. 

Now,  with  the  design  for  n(n  ,  ^-n) -concentra¬ 
tors,  Pinsker  considered  (n  ,j)  -concentrators  for 
1  <a  <^n  .  This  was  done  by  considering  ranges  of 
a  .  We  will  likewise  consider  our  lower  bound 
over  these  ranges.  Figure  2  shows  a  graph  of 
k  versus  a  where  kn  is  our  lower  bound  on 
Pinsker  (n  ,  — ) -concentrators. 


Lower  bound  on  Piniker't  in.;) -concentrator. 


Range  ( 1 )  1  < a  <^ :  In  this  range,  the  above 

(n  ,  5  n) -concentrator  is  used  by  simply  designing 
6*  n 

this  network  to  have  the  correct  number  of  — 


outputs  and  then  scaling  down  the  number  of 

inputs  appropriately.  Hence,  in  this  range,  the 

lower  bound  on  the  number  of  crosspoints  in 

Pinsker's  (n  ,  )  -concentrator  is  (7.4491)  A  •  ■-) 

&  a  5  a 

Range  (ii)  <_  a  <_  3  :  In  this  range,  Pinsker 

uses  the  serial  product  of  a  fixed  ratio 

(”  ’  ?  a  ’  l)  -concentrator  and  a  Ir  j  i  jl" 

concentrator.  Hence,  in  this  range  our  lower 

bound  on  the  number  of  crosspoints  in  Pinsker's 

(n  ,  j>- concentrator  is 

( - i£2_2 - ,  n+  (7.4491)  (|  £)  . 

1°9  5^-  +  log  a  ' 

Range  (iii)  3  <_  a  <_  n  :  This  range  is  treated 

by  Pinsker  as  a  set  of  subranges*  In  each  sub¬ 
range,  his  design  consists  of  an  initial  serial 
product  of  fixed  ratio  concentrators  to  reduce 
the  required  concentration  to  that  which  can  be 
addressed  by  a  concentrator  in  range  (ii) •  A 
serial  product  of  that  initial  composite  network 
is  then  taken  with  the  appropriate  concentrator 
from  range  (ii)  to  complete  the  design.  For 


example,  for  the  subrange 


<  a  <  w 

O  vs  .  Z 


product  of  a  fixed  ratio  (n,2.  n  ,§.) -concentrator 
and  an  (^  n  ,  jl.) -concentrator ^with  a  design  from 
range  (ii)  is  used.  Hence,  our  lower  bound  on 
crosspoints  on  Pinsker's  (n  ,  ^-^concentrator  in 
this  subrange  is 


log  — +  log  a  log 


log  a  .  2  ....  6  2  n, 

'-r~"-,*3-n+<7-449i 

—  +log  a 


9  27  • 

Likewise,  in  the  subrange  y  ±  a  <  —  , ^the 
serial  product  of  the  fixed  ratio  4(n  )jn  ,£-)- 
concentrator  and  a  fixed  ratio  (^n,^n  ,  J-) - 
concentrator  is  used  to  bring  the  concentrator 
into  range  (ii).  The  serial  product  of  this 
initial  network  is  then  taken  with  a  (in,  £.) - 
concentrator  resulting  in  a  lower  bound9of  a 
2 

,  log  ,  +  log  a 

,  log  a  .  ,3  ,2 

( - _2 -  )n+  ( - - -  j  n  + 

log  y  +  log  a  log  —  +  log  a 


log  -+  log  a 
(  £ 

109  5 a  +  loq  * 


•)  |n  + (7.4491)  (|*|-n) 


27  81 

Similarly,  the  subrange  —  £a£—  uses  the  serial 
product  of  3  fixed  ratio  concentrators ,  namely, 
an  (n  ,  ^-n  ,  £.) -concentrator,  an  £n  ,  in  ,  —)- 
concentrator,  and  an  (in  ,  yyn  , -concentrator , 
to  bring  the  concentrator  into  range  (ii). 

Hence,  our  lower  bound  curve  of  Pinsker's 
(n  ,  i) -concentrator  can  be  obtained. 

Pippenger's  Superconcentrator 


An  (n,n) -superconcentrator  is  an  intercon¬ 
nection  network  of  somewhat  more  powerful  con¬ 
necting  capability  than  that  of  a  concentrator  in 
the  sense  that  there  are  n  inputs  and  n  outputs 
and  for  any  specified  subsets  of  k  <  n  inputs 
and  outputs,  there  exists  a  disjoint  path  to  con¬ 
nect  each  input  to  some  output  in  the  respective 
subsets.  Hence,  unlike  a  concentrator,  for  a 
superconcentrator  both  the  input  and  output  sub- 


lets  to  be  connected  can  be  specified  and  they 
can  be  of  any  size  less  than  or  equal  to  n . 
However,  within  these  subsets  the  particular  one- 
to-one  connections  cannot  be  specified. 

(n.n)-superconcentrators  were  defined  by 
Valiant  [8]  and  shown  to  exist  with  at  most  238 n 
crosspoints.  Pippenger  later  improved  this  bound 
to  40  n  (2].  Recently,  Gabber  and  Gabll  13]  gave 
an  explicit  construction  of  an  (n,n) -superconcen¬ 
trator  requiring  273  n  crosspoints.  We  will  con¬ 
clude  this  paper  by  considering  a  lower  bound  on 
Pippenger* s  superconcentrator  design. 


rlsur*  Si  ripp«i»9«r*s  <tt,al-»up«rctanc»ntr«tor. 

Consider  the  composite  network  shown  in 
Figure  3.  This  network  can  be  described  as 

ljn,n|  (N1  ->  u2  ->  NJ  . 

Pippenger  showed  that  if 

is  an  (n  ,  jn  ,  jn  ) -concentrator 

and 

2  2 

is  an  Ijn  ,  -jn  )-superconcentrator, 

then  this  composite  network  is  an  (n, n) -super¬ 
concentrator.  Hence,  a  lower  bound  on  PippengeA 
(n,n) -superconcentrator  at  this  point  is 

N  4  2  (t — — — )n4  (lower  bound  on  Pippenger's 

log  4  -  log  7 

(^n  ,^n  ) -superconcentrator) .  Recursive  evalua¬ 
tion  yields  a  lower  bound  on  crosspoints  in  a 
Pippenger  (n,n) -superconcentrator  as  17.4565  n. 

The  key  requirement  in  Pippenger's  design 
is  that  Nj  has  a  capacity  of  in  ,  but  the  only 
requirement  on  the  size  of  the  output  set,  say, 

6  n  is  that  1  <  6  <  -j-  .  Our  lower  bound  on  this 
fixed  ratio  concentrator  is 
loo  2 

'log  2  ♦  log  6'  "  • 

This  gives  the  number  of  crosspoints  in  Pippen¬ 
ger's  (n.n)-superconcentrator  as 


(1  ♦  2( 


1o9 


log  2  e  log  6 


J) 


each 


which  has  a  minimum  of  17,0652 n  at  £  -  0.70251  • 
In  this  version  of  Pippenger's  design, 

(n, 0.70251  n  ,  |n  ) -concentrators  are  used, 
with  an  average  of  2.03635  crosspoints  per  lrput* 
Pippenger's  (n,n) -superconcentrator  cannot  be 
constructed  with  fewer  crosspoints. 


Conclusion 


In  this  paper  we  have  used  a  straightforward 
requirement  on  the  number  of  crosspoints  In 
sparse  crossbar  (n,m) -concentrators  to  obtain 
lower  bounds  for  the  more  general  cases  of  <n,s,ch 
concentrators.  These  lower  bounds  were  attained 
by  means  of  an  argument  which  showed  that  a  non- 
biased  or  singular  distribution  of  a  given  num¬ 
ber  of  crosspoints  between  inputs  and  outputs  in 
a  sparse  crossbar  network  is  the  most  efficient 
use  of  those  crosspoints  regarding  the  resulting 
capacity.  In  certain  cases  It  was  seen  that  the 
lower  bound  could  actually  be  satisfied:  namely, 
when  the  number  of  inputs,  outputs,  and  the 
capacity  satisfy  the  so-called  binomial  interre¬ 
lationships,  as  in  such  situations  the  singular 
distribution  satisfying  our  lower  bound  require¬ 
ments  is  achievable  with  an  integer  number  of 
crosspoints  per  input.  The  use  of  sparse  cross¬ 
bar  concentrators  as  components  in  the  designs  of 
other  types  of  interconnection  networks  permits 
us  to  use  our  lower  bound  results  to  obtain  lever 
bounds  on  crosspoints  in  those  designs  as  well. 

In  particular  we  considered  Pinsker*s  multi-stage 
concentrators  and  Pippenger's  superconcentrators. 


References 


1.  H.  Pinsker,  "On  the  Complexity  of  a  Concen¬ 
trator  ,  *  Proc.  75th  International  Teletraffic 
Conference,  Stockholm,  1973,  pp. 318/1-318/4. 


2.  N.  Pippenger,  “Super concentrators,"  SIAM 
Jour.  Comp.,  Vol.  6,  No.  2,  1977,  pp. 298-304. 

3.  0.  Gabber  and  Z.  Galil,  “Explicit  Construc¬ 
tions  of  Linear  Size  Super concentre tors,*  to 
appear  In  the  Journal  of  Computer  and  System 


Sciences. 


4.  G.  A.  Margulis,  “Explicit  Constructions  of 
Concentrators,"  Problems  of  Information  Trans¬ 
mission,  Plenum,  New  York,  1975. 


5.  G.  Masson,  "Binomial  Switching  Networks  for 
Concentration  and  Distribution,"  IEEE  Trans. 
Conan,,  Vol.  COM-25,  No.  9,  1977,  pp. 873-883. 


6.  P.  Hall,  "On  Representatives  of  Subsets," 
J.  London  Math.  Soc.,  Vol.  10,  1935,  pp. 26-30. 


7.  S.  Nakamura,  "I/>ver  Bounds  and  Product  Designs 
for  Concentrators, ”  Ph.D.  Dissertation,  The  Johns 
Hopkins  University,  in  preparation. 


8.  L.  Valiant,  "On  Non-line&r  Lower  Bounds  in 
Computational  Complexity,"  Proc.  7th  Ann.  ACM 
Sycp.  on  Theory  of  Computing.  Albuquerque,  N.M. 
1975. 


